Devin AI Reviews & Product Details

Top-Rated Alternatives

HubSpot Marketing Hub

View All Alternatives

Devin AI Integrations

(1)

Integration information sourced from real user reviews.

G2 reviews are authentic and verified.

Here's how.

SK

Sree K.

Software Engineer II in Test

Information Technology and Services

Mid-Market (51-1000 emp.)

1/20/2026

"End-to-End Test Automation on Autopilot with Devin"

5/5

What do you like best about Devin AI?

For me, the absolute best thing about Devin is its complete autonomy and the way it handles the entire end-to-end automation process without needing me to babysit it. I can simply drop in a link to the test suite from Azure DevOps, and it takes it from there. It logs into the application, finds the UI elements, and writes the Java code in the local Eclipse setup we have on its machine. The fact that it can run the tests and keep healing the script until it passes is a massive time-saver. I can have five different sessions running in parallel, which means I’m getting a whole week’s worth of manual automation done in a single day.

Ease of use is genuinely high because it’s mostly just natural-language prompting. I don’t have to write code snippets the way I do with other ai tools; I just explain the logic and it does the rest. Implementation was a bit more of a project, though, because setting up the dedicated machine with Eclipse and the right paths for our Azure Git repo took some time. Once that was done, everything has been smooth. The integration with Azure DevOps is surprisingly good as well, since it has a native way to handle those connections through the secrets manager and PAT.

I use Devin almost every single day now for any new test case development. The feature set is impressive, especially how it creates its own computing environment and uses its own browser to analyze the UI. It feels more like an actual teammate than just a tool. Customer support has been fairly responsive when I’ve hit those weird ACU consumption bugs, although most of the time I can figure things out from the logs Devin provides.

Like I mentioned, it’s not perfect. Sometimes it gets overexcited and changes core framework methods, which is something I have to watch out for in every PR. And that deviation after 50 ACU is definitely annoying, because it starts to ignore the initial logic. Still, as a tester who wants to scale up automation quickly, these feel like small prices to pay for the amount of work it gets done. It has completely changed how I manage my sprint tasks. Review collected by and hosted on G2.com.

What do you dislike about Devin AI?

It keeps messing with things it shouldn’t touch. There have been several times when it decided to refactor our core pre-built methods in the automation framework, even though it was only supposed to write a simple test script. That’s frustrating because I then have to spend extra time during PR review double-checking that it didn’t break some global logic that all our other tests depend on. It’s like it gets overexcited and tries to be too helpful, but it ends up creating more work for me to verify.

The other major issue is how it starts to deviate after a long session. I’ve noticed that once the ACU consumption hits around 40 or 50, Devin really starts to lose the plot. It begins ignoring the initial instructions I gave it, and the logic starts drifting in weird directions. It feels like the model gets tired and forgets the original goal of the session. I usually have to kill the session and start a completely fresh one just to get it back to being productive, which is a bit of a waste of time.

I also find the initial setup for the dedicated machine and secrets a bit tedious. Since it doesn’t have direct access to Azure DevOps, I have to manage all the creds and PATs as secrets inside Devin, which is just another thing to keep track of. And while it’s impressive that it can run Eclipse locally and debug its own code, execution speed can sometimes be slow compared to a human just running the script. Overall, it’s a great tool, but the overreaching code changes and the reliability issues in long sessions are definitely the biggest downsides for me. Review collected by and hosted on G2.com.

What problems is Devin AI solving and how is that benefiting you?

The biggest problem Devin solves is the manual bottleneck of writing repetitive automation code from scratch. Before Devin, I would spend hours analyzing UI elements, writing locators, and manually building out the Java files in Eclipse. Now Devin takes care of that tedious discovery work. It also reduces debugging fatigue thanks to its “self healing” capability: it runs the test, identifies the failure, and iterates on its own code until the test passes. For a tester with a lot on their plate, not having to hunt for missing semicolons or broken XPath locators is a massive relief.

It basically acts as a force multiplier for me. Because I can run multiple Devin sessions in parallel, I can work on automating an entire sprint’s worth of stories at the same time. While I’m thinking through the high-level logic for one scenario, Devin is already finishing the code for three others. It shifts my role from being a “coder” to being more of an “architect”: I provide the intent and the credentials, and Devin does the heavy lifting on implementation. As a result, we can ship our automation suites much faster, and I can spend more time on complex testing strategies instead of writing boilerplate.

The fact that it can work directly within our Azure Git repo and handle the PR process is huge. Even though it doesn’t have direct access to ADO, the way it uses the secrets I provide to log in and read the test cases still makes it feel tightly integrated into our workflow. It’s also a big benefit that it uses the exact same Eclipse setup we have locally, because the code it generates is already compatible with our environment. Overall, it bridges the gap between a manual test case in ADO and a finished, running automation script without me having to act as the middleman. Review collected by and hosted on G2.com.