What to cover with integration tests?

Question

I'm working on adding unit/integration tests to an existing project (Java/Spring Boot) and I've been investigating on how they are "separated" in order to cover the test cases and how to focus each type of tests based on what they are written for.

To put things simple, my two questions are:

Is it a good strategy to cover all test cases (including edge cases, exception handling, etc.) with unit tests and cover just some of all the test cases (let's say common user workflows) with integration tests?
How do you approach writing integration tests? What do you usually focus on when writing integration tests for the functionalities you develop in your programs? Do you cover only a couple of happy paths or do you cover the same cases as you do with unit tests?

The first question is to know if the conclusions I've come to in order to do what I need to do are acceptable and that the strategy I'm thinking on is also acceptable. The second question is to get to know a bit more of how other developers actually do it in real life.

candied_orange · Answer 1 · 2023-12-23T02:01:52.663

Unit tests should test the behavior of your core code. They should be deterministic, parallelizable, and fast.

Integration tests should test that the rest of the system, that you had to cut out of the unit test to make it be those things, is at least integrated to the core correctly. This isn't about behavior. It's about if things are plugged in.

This isn't how everyone separates these responsibilities. But it's how I do it. I'm one of those annoying ones who wont let you call it a unit test just because you run it with jUnit.

_{Be aware, this testing pattern can be defeated by putting business rules in the database.}

J_H · Answer 2 · 2023-12-29T19:12:45.850

Wow, so many questions! Let's take a step back for a moment.

Why do we write tests? Any tests, of any kind?

The team spends time writing tests in order to save time.

Suppose a bug report is filed, and we need to diagnose whether the defect is in module A, B, or C? (I'm being simplistic, sometimes there's greater subtlety to it than that.) If A & B have such good unit test coverage that we feel confident in their correctness, we can triage the issue by focusing our efforts on how C might interact with the buggy behavior.

Suppose a change to existing feature or addition of new feature motivates refactoring our code to accommodate recently discovered needs. How do I know my refactor didn't break existing functionality? Test coverage gives us the courage to wade in, make changes, and still be confident the old stuff will keep on ticking.

There's a thread that runs through these: coverage. You should measure your current test coverage, and let that play a factor in deciding which tests to write next.

Ok, having reflected for a moment on what we hope to get out of them, let's look at the process of writing tests.

If you're thinking about it before writing a new feature, I claim you can "get your tests for free!". That is, you know you're going to have to execute that line of target code you wrote for the new feature. You could do it interactively. Or you could write an automated test which exercises the target code and verifies some sensible result. This initially looks a lot like traditional "printf debugging", but once the target code is doing the Right Thing the test can lock in the observed result and you get your test "for free". Months down the road future maintenance engineers will thank you, when they have to refactor that code.

Back to measuring coverage. Not all target code in your repo was written in that way. So now you want to retrofit tests on top of it. Start with a measurement, e.g. $ pytest --cov *.py. Or use poor man's profiling: set a debugger breakpoint for the method of interest, run tests, and either we hit the breakpoint or we don't. If not, write a covering test, and verify the breakpoint triggers.

But wait, you say. How do I know whether to write a unit test or an integration test? It doesn't matter. Just arrange for an automated test to exercise the source line of interest. If the API offers convenient access to the guts of the code, letting you call a tiny helper directly, then great! You have a tiny unit test. If the API makes it "hard" to do that, you might wind up with a big integration test which takes seconds to run and which pushes several frames onto the stack by the time it hits that line of code. Sigh! You have an integration test. Score a win anyway. Then write a ticket to change the target code's Public API so it is easier to directly test, and prioritize it in a subsequent sprint.

If you can cover the Happy Path for much of the code you care about, you're probably ahead of the game. Folks can spend a lot of time trying to move from 90% coverage to 95% coverage, exercising all those error handlers. Usually you will see diminishing returns from such effort. The important lesson to learn is that the Public API should be designed with all consumers in mind, both target code functions and test code functions. You might be able to refactor, to improve testability. More likely you will learn what traps to avoid next time when you're implementing some new feature that we might possibly wish to test.

Is it a good strategy ...

I claim the most important thing to do, for a codebase with near zero coverage, is to write a couple of integration tests that exercise a fair bit of target code, even if stack depth gets a bit deep. Any exercised line at least has an opportunity to throw fatal error in CI/CD, so if someone broke it by always dividing by zero we will know about it quickly.

Once your code is largely covered, you can come back and write more narrowly focused unit tests. Often you will need DI and similar refactors, changes to your Public API, to make the target code unit testable. Pay attention to where most bug reports are coming from, and let that prioritize your refactors.

How do you approach writing integration tests?

Write a Happy Path test for some input data that a stakeholder cares about. If you have more time, maybe try to exercise error handlers. But often that effort will be better spent writing narrowly focused unit tests that exercise error handlers.

Integration tests can be written to verify documented failure modes, e.g. deliberately supplying wrong password and verifying the expected "login denied, retry" user flow occurs. But typically the effort spent writing and monitoring an integration test is better spent if it tests a single item in a requirements document. Think of it as a small slice of an E2E system test. (And in the end you always a manual tester to occasionally follow a written sequence of test steps, to verify that Selenium or other automated test technology isn't hiding something that prevents users from accomplishing their goals. For example, a CSS "whoops!" that gives tiny font or low contrast text could be enough render a component unusable, despite the DOM having the "same" rendered objects it had last month when users were happy.)

If you want to exhaustively verify that error handlers function properly, and that all edge case inputs are dealt with, an integration test is not the way to go. Prefer narrowly focused unit tests for that.

If your app wasn't written with that in mind, you might need DI refactors or other (breaking!) changes to the Public API you designed, so it can accommodate the "one more" requirement of being readily testable.

score 2 · Answer 3 · answered Dec 21 '23 at 22:53

Integration tests cover an actual service call(s) to one or more dependencies. For example, it could be a database call, dropping a message to a queue, calling a 3rd party service, and/or calling an internal service.

It should be a real call with real data. What does it cover?

If the integration tests fail it could cover the possible failures:

Infrastructure related
Data related

Once the application is deployed, it can be a good idea to run a few integration tests to make sure everything is configured properly, and data is flowing into and out of the system. Maybe someone forgot a firewall rule or misspelled a database connection string. Sometimes this is called smoke testing but in essence it is an integration test. This is the first coverage point.

The second coverage point is to ensure that your existing service calls are returning/sending the expected data. For example, say your application calls a third-party service called https://thirdpartyservice/Foo which is a GET.

The integration test response should be:

{
    "name": "Bar",
    "age": 34
}

Let's say you don't get that back but instead get back:

{
    "lastName": "Bar",
    "age": 34
}

So, now it has failed. These can alert one to unexpected changes in services, changes in functionality, etc. It can serve as early detection for issues when functionality is updated that is beyond your control. Now this example is fairly simple with one of the attributes being returned having changed but hopefully it illustrates the point. It would still require a code change on your part to reflect the change and possibly updates to unit tests as well.

One shouldn't have a lot of integration tests, just a few to cover basic smoke testing and data interactions. Also, for CI/CD concerns, integration testing could be run for every deployment, but not every build. Probably best case is on demand as needed. It could be a post deployment step as well. Sometimes, the smoke testing is built into the application as part of a health check.

Unit tests cover your code and it's functionality. If those cases, the databases, external services, etc. are mocked or stubbed out because the tests are for your logic, not the dependency(s). Many times, if a dependency changes, one will need to update mocks/stubs to reflect the change in data from the dependent service.

score 0 · Answer 4 · answered Dec 29 '23 at 17:53

Generally integration tests take much longer to run than unit tests, and it's harder to find problems that integration tests expose becacause they cover so much code. So yes, it's generally better to cover everything possible that you can with unit tests, and use integration tests only to cover things to difficult to cover with unit tests and, of course, test that units are plugged together correctly.

My general approach when writing a new program (or adding tests to an existing program with few to no tests) is to write an initial integration test that covers "end to end" use. For a command line program, for example, it would actually run the program with various parameters and inputs and examine the outputs.

Then, as I continue to build, modify and refactor the program, I try to add unit tests rather than additional integration test functionality, and also move what's currently into integration tests into unit tests. How much you can do in unit tests rather than integration tests depends a lot on the maturity of your code: when you have e.g. no parser at all for the input data, or a quick hack of a parser that covers just a few cases, it's easier to use an integration test to run data through the entire program. As the structure of such a parser becomes clear, it becomes easier and easier to test smaller and smaller parts of the parser with unit tests, and the risk of major structural changes requring a complete rework of the unit tests for that area lessens.

score 0 · Answer 5 · answered Dec 31 '23 at 18:12

Usually in a request processing pipeline, there is an "IO - core logic - IO" sandwich. The IO can read/write from/to UI/DB/external service. The "core logic" part of the pipeline, devoid of any IO, should be separated into a (possibly "pure") function/method.

Then if only the "core logic" is tested, it would be a unit test.
But when the full pipeline is tested, with all IO operations, it would be an integration test.

What to cover with integration tests?

5 Answers5