Performance testing is a form of software testing that focuses on how a system running the system performs under a particular load. This is not about finding software bugs or defects. Performance testing measures according to benchmarks and standards. Performance testing should give developers the diagnostic information they need to eliminate bottlenecks.
Types of performance testing for software
To understand how software will perform on users’ systems, there different types of performance tests that can be applied during software testing. This is non-functional testing, which is designed to determine the readiness of a system. (Functional testing focuses on individual functions of software.)
Image credit MindsMapped
Load testing measures system performance as the workload increases. That workload could mean concurrent users or transactions.The system is monitored to measure response time and system staying power as workload increases. That workload falls within the parameters of normal working conditions.
Unlike load testing, stress testing — also known as fatigue testing — is meant to measure system performance outside of the parameters of normal working conditions. The software is given more users or transactions that can be handled. The goal of stress testing is to measure the software stability. At what point does software fail, and how does the software recover from failure?
Spike testing is a type of stress testing that evaluates software performance when workloads are substantially increased quickly and repeatedly. The workload is beyond normal expectations for short amounts of time.
Endurance testing — also known as soak testing — is an evaluation of how software performs with a normal workload over an extended amount of time. The goal of endurance testing is to check for system problems such as memory leaks. (A memory leak occurs when a system fails to release discarded memory. The memory leak can impair system performance or cause it to fail.)
Scalability testing is used to determine if software is effectively handling increasing workloads. This can be determined by gradually adding to the user load or data volume while monitoring system performance. Also, the workload may stay at the same level while resources such as CPUs and memory are changed.
Volume testing determines how efficiently software performs with a large, projected amounts of data. It is also known as flood testing because the test floods the system with data.
Most Common Problems Observed in Performance Testing
During performance testing of software, developers are looking for performance symptoms and issues. Speed issues — slow responses and long load times for example — often are observed and addressed. But there are other performance problems that can be observed:
- Bottlenecking — This occurs when data flow is interrupted or halted because there is not enough capacity to handle the workload.
- Poor scalability — If software cannot handle the desired number of concurrent tasks, results could be delayed, errors could increase, or other unexpected behavior could happen that affects:
- Disk usage
- CPU usage
- Memory leaks
- Operating system limitations
- Poor network configuration
- Software configuration issues — Often settings are not set at a sufficient level to handle the workload.
- Insufficient hardware resources — Performance testing may reveal physical memory constraints or low-performing CPUs.
Seven Performance Testing Steps
Image credit Gateway TestLabs
Also known as the test bed, a testing environment is where software, hardware, and networks are set up to execute performance tests. To use a testing environment for performance testing, developers can use these seven steps:
1. Identify the testing environment.
Identify the hardware, software, network configurations and tools available allows the testing team design the test and identify performance testing challenges early. Performance testing environment options include:
- Subset of production system with fewer servers of lower specification
- Subset of production system with fewer servers of the same specification
- Replica of productions system
- Actual production system
2. Identify performance metrics.
In addition to identifying metrics such as response time, throughput and constraints, identify what are the success criteria for performance testing.
3. Plan and design performance tests.
Identify performance test scenarios that take into account user variability, test data, and target metrics. This will create one or two models.
4. Configure the test environment.
Prepare the elements of the test environment and instruments needed to monitor resources.
5. Implement your test design.
Develop the tests.
6. Execute tests.
In addition to running the performance tests, monitor and capture the data generated.
7. Analyze, report, retest.
Analyze the data and share the findings. Run the performance tests again using the same parameters and different parameters.
What Performance Testing Metrics are Measured
Metrics are needed to understand the quality and effectiveness of performance testing. Improvements cannot be made unless there are measurements. There are two definitions that need to be explained:
- Measurements — The data being collected such as the seconds it takes to respond to a request.
- Metrics — A calculation that uses measurements to define the quality of results such as average response time (total response time/requests).
There are many ways to measure speed, scalability, and stability but each round of performance testing cannot be expected to use all of them. Among the metrics used in performance testing, the following often are used:
Total time to send a request and get a response.
Also known as average latency, this tells developers how long it takes to receive the first byte after a request is sent.
Average load time
The average amount of time it takes to deliver every request is a major indicator of quality from a user’s perspective.
Peak response time
This is the measurement of the longest amount of time it takes to fulfill a request. A peak response time that is significantly longer than average may indicate an anomaly that will create problems.
This calculation is a percentage of requests resulting in errors compared to all requests. These errors usually occur when the load exceeds capacity.
This the most common measure of load — how many active users at any point. Also known as load size.
Requests per second
How many requests are handled.
A measurement of the total numbers of successful or unsuccessful requests.
Measured by kilobytes per second, throughput shows the amount of bandwidth used during the test.
How much time the CPU needs to process requests.
How much memory is needed to process the request.
Performance Testing Best Practices
Perhaps the most important tip for performance testing is testing early, test often. A single test will not tell developers all they need to know. Successful performance testing is a collection of repeated and smaller tests:
- Test as early as possible in development. Do not wait and rush performance testing as the project winds down.
- Performance testing isn’t just for completed projects. There is value in testing individual units or modules.
- Conduct multiple performance tests to ensure consistent findings and determine metrics averages.
- Applications often involve multiple systems such as databases, servers, and services. Test the individual units separately as well as together.
Image credit Varun Kapaganty
In addition to repeated testing, performance testing will be more successful by following a series of performance testing best practices:
- Involve developers, IT and testers in creating a performance testing environment.
- Remember real people will be using the software that is undergoing performance testing. Determine how the results will affect users not just test environment servers.
- Go beyond performance test parameters. Develop a model by planning a test environment that takes into account as much user activity as possible.
- Baseline measurements provide a starting point for determining success or failure.
- Performance tests are best conducted in test environments that are as close to the production systems as possible.
- Isolate the performance test environment from the environment used for quality assurance testing.
- No performance testing tool will do everything needed. And limited resources may restrict choice even further. Research performance testing tools for the right fit.
- Keep the test environment as consistent as possible.
- Calculating averages will deliver actionable metrics. There is value in tracking outliers also. Those extreme measurements could reveal possible failures.
- Consider the audience when preparing reports that share performance testing findings. Also, include any system and software changes in reports.
Five common performance testing mistakes
There are also some mistakes that can lead to less-than-reliable results when performance testing:
- Not enough time for testing.
- Not involving developers.
- Not using QA system similar to production system.
- Not sufficiently tuning software.
- Not having a troubleshooting plan.
Performance Testing Fallacies
Performance testing fallacies can lead to mistakes or failure to follow performance testing best practices. According to Sofia Palamarchuk, these beliefs can cost significant money and resources when developing software:
Performance testing is the last step in development.
As mentioned in the section on performance testing best practices, anticipating and solving performance issues should be an early part of software development. Implementing solutions early will less costly than major fixes at the end of software development.
More hardware can fix performance issues.
Adding processors, servers or memory simply adds to the cost without solving any problems. More efficient software will run better and avoid potential problems that can occur even when hardware is increased or upgraded.
The testing environment is close enough.
Conducting performance testing in a test environment that is similar to the production environment is a performance testing best practice for a reason. The differences between the elements can significantly affect system performance. It may not be possible to conduct performance testing in the exact production environment, but try to match:
- Hardware components
- Operating system and settings
- Other applications used on the system
What works now, works across the board.
Be careful about extrapolating results. Don’t take the small set of performance testing results and assume that they will be the same when elements change. Also, it works in the opposite direction. Do not infer minimum performance and requirements based upon load testing. All assumptions should be verified through performance testing.
One performance testing scenario is enough.
Not every performance problem can be detected in one performance testing scenario. But resources do limit the amount of testing that can happen. In the middle are a series of performance tests that target the riskiest situations and have the greatest impact on performance. Also, problems can arise outside of well-planned and well-designed performance testing. Monitoring the production environment also can detect performance issues.
Testing each part equals testing the whole system.
While it is important to isolate functions for performance testing, the individual component test results do not add up to a system-wide assessment. But it may not be feasible to test all the functionalities of a system. A complete-as-possible performance test must be designed using the resources available. But be aware of what has not been tested.
What works for them, works for us.
If a given set of users does experience complications or performance issues, do not consider that a performance test for all users. Use performance testing to make sure the platform and configurations work as expected.
Software developers are too experienced to need performance testing.
Lack of experience is not the only reason behind performance issues. Mistakes are made — even by developers who have created issue-free software in the past. Many more variables come into play — especially when multiple concurrent users are in the system.
A full load test tells everything.
Test scripts are actual users.
Make sure the test automations are using the software in ways that real users would. This is especially important when performance test parameters are changed.
Additional Resources on Performance Testing
- Performance Testing Guidance for Web Applications (Microsoft)
- Performance Testing Checklist Packet (SmartBear PDF)
- “Perfect Software and Other Illusions About Testing” (Book by Jerry Weinberg)
- Top Load Testing Tools: 50 Useful Tools for Load Testing Websites, Apps, and More (Stackify)
- The Tester (Special Interest Group In Software Testing (SIGIST) newsletter)
- Software Performance Testing (TechTarget)
- Looking for New Relic Alternatives & Competitors? Learn Why Developers Pick Retrace - May 1, 2019
- Better code, Better results: No BS [Webinar] - December 20, 2018
- New Feature in Retrace: Application Scoring - May 24, 2018
- What’s New in Retrace: Java on Windows Support Now Available - March 20, 2018
- SST Software Fixes Problems behind the Scenes with Retrace - February 1, 2018