Custom tooling for the test stack management

Summary

Azimo's engineering team developed a custom tool called AutomationTestSupervisor to manage their mobile app testing stack, significantly improving test efficiency and app quality.

Abstract

Azimo has made substantial progress in their mobile app testing processes over three years, evolving from no tests to achieving 60% unit test coverage and implementing hundreds of functional and UI tests that run automatically. This transformation was accomplished by a small team and led to a shift from bi-monthly to bi-weekly app releases, with a notable increase in app stability to 99% crash-free users. However, the growth in test complexity necessitated the creation of a tailored solution, as existing tools like Fastlane and Spoon became inadequate. The team's custom-built AutomationTestSupervisor, written in Python, provided enhanced control over the test cycle, including logging, test package management, sharding, re-running failed tests, and device management. This tool reduced testing time by approximately 50%, enabled parallel testing on multiple devices, and facilitated easier debugging and maintenance. The shift to a 1-week release train, made possible by these improvements, has been a significant advancement in Azimo's product development process.

Opinions

The team recognized the limitations of existing testing tools and the need for a more tailored solution to address their specific challenges.
Building a custom tool from scratch was seen as a necessary step to gain full control over their testing environment and to overcome the constraints of their previous setup.
The transition to Python for their AutomationTestSupervisor was strategic, aligning with their existing tech stack and eliminating the need for Ruby, which was not a core competency within the team.
The team values the ability to quickly iterate and release new app versions, emphasizing the importance of speedy experimentation and feedback loops in their development philosophy.
Despite the challenges of maintaining a custom tool, the team believes the benefits, such as reduced testing time and improved debugging capabilities, outweigh the costs.
The team is committed to continuous improvement, as evidenced by their ongoing efforts to refine their testing processes and the anticipation of future enhancements in product development.

The evolution of apps QA at Azimo

This is the third in a series of blog posts in which we outline our multiple years’ experience with our Android app testing at Azimo. Most of the principles, goals, and achievements also apply to our iOS app.

Table of content

After around three years of development, we improved our testing stack and app’s quality significantly. From 0 tests to about 60% unit tests coverage, from manual QA testing to hundreds of functional and UI tests run automatically, from one release per two months to the new app version published in the store every two weeks. App’s stability went up from countless crashes to 99% of crash-free users. We achieved all of that with a relatively small team of 3–4 software engineers and 1 QA engineer, separately on Android and iOS. Changes were introduced in small steps, quarter by quarter, thanks to conscious goals management. We had never frozen product development for more than 1–2 weeks.

While the results of our work were quite impressive, we also started facing new challenges. With hundreds of functional and UI tests, the tools we used (Fastlane with our custom plugin, Spoon, and others) weren’t well suited for our needs anymore. Here are some of the problems we faced:

Our test suit was overgrown, to say the least. Because of that, it wasn’t possible to run all the tests at once. A single run would last for more than 5 hours. And due to big flakiness, it was nearly impossible to make all tests passing. Tests sharding solutions provided by Android didn’t work very well for our needs (e.g., by not balancing the number of tests properly).

Our testing stack was hard to debug — the test suite results, test runners logic, ADB communication, or AVDs management. We even defined some specific requirements for improving these. But due to lack of competencies (e.g., no Ruby language engineers), in many cases, we could rely on the community’s help and responses to the issues we reported.

Custom tooling for the test stack management

The evolution of apps QA at Azimo

Automation Tests Supervisor

Story behind AutomationTestSupervisor — our custom made tool for Android automation tests

AutomationTestSupervisor is a Python tool that is capable of creating, running Android Virtual Devices, building and…