Alumnium

A QA engineer inherits a checkout flow test that breaks every time the UI changes

14 views

🔍 Click to enlarge

A QA engineer inherits a checkout flow test that breaks every time the UI changes. The test uses dozens of CSS selectors that need constant updates. She rewrites it with Alumnium, replacing "click button with class checkout-btn-v2" with simply "do click the checkout button." When developers redesign the page next month, the test keeps running without modification because the AI interprets the natural language against whatever button actually exists.

Alumnium translates plain English test instructions into browser commands. You write what you want the test to do using three commands: 'do' for actions, 'check' for verifications, and 'get' for data extraction. The AI figures out how to execute those instructions by analyzing the application's accessibility tree and taking screenshots. It works with Playwright, Selenium, and Appium, so it fits into existing test infrastructure rather than replacing it.

A test automation engineer at a fintech company needs to verify transaction flows across web and mobile apps. He writes "do enter 500 in the amount field" and "check that confirmation message appears." Alumnium handles both platforms through the same natural language interface. When product adds a currency selector dropdown, he just writes "do select USD from currency" without diving into mobile element locators or web DOM structures.

The philosophy differs from AI tools that generate entire test suites. Alumnium does not write tests. Engineers still design test logic, structure, and assertions. It just handles the tedious part—telling browsers what to click and where to type. A senior test engineer keeps control over test architecture while junior team members can contribute test steps without mastering complex selector strategies.

Support for reasoning models improves accuracy when instructions get ambiguous. The AI can better distinguish between similar buttons or fields when multiple matches exist. Integration with Model Context Protocol means general-purpose AI agents can trigger test runs or extract test data as part of broader workflows.

Currently works with Python test frameworks. JavaScript and Ruby support remains in development, which blocks teams standardized on those languages. A Ruby on Rails shop running RSpec would need to wait or restructure their testing approach. TypeScript support exists, though documentation doesn't specify which frameworks.

Version 0.17 appears free with Alumnium Pro mentioned as a future paid tier. No details around what Pro includes or when it launches.

The MCP Server integration matters for teams using AI-assisted development workflows. A developer using an AI coding assistant could theoretically trigger Alumnium tests directly from their editor, though this requires the assistant to support Model Context Protocol.

Where it breaks: complex conditional logic still needs traditional code. A test that branches based on user state or performs calculations can't express everything in natural language. Alumnium works with integrations like Anthropic, Google, Meta, and OpenAI models, but switching models might change interpretation consistency. An instruction that works reliably with one AI provider could behave differently with another.

Teams fully invested in codeless testing platforms won't find what they need here. This requires writing code and understanding test framework structure. A business analyst creating smoke tests through record-and-playback tools would hit a learning curve.

Don't use this if your team runs tests exclusively in JavaScript or Ruby right now. Wait for language support. Skip it if your organization prohibits sending application screenshots to AI providers for privacy reasons—Alumnium needs visual context to interpret instructions.