Overview › Testing

Testing

~280 test files covering ViewModels, mappers, repos; UI test coverage moderate.

66 findings in this category

SCORE

Executive summary

Costco Android has a mature, multi-layered test stack: ~170 unit-test files, ~80 instrumentation tests across 30+ modules. The unit-test foundation is JUnit 4 + MockK + Turbine + a custom MainCoroutineRule; UI automation uses Compose UI Test + Espresso + UiAutomator with Karumi Shot for screenshot regression. JaCoCo enforces a 40.81% coverage threshold, which is low for a retail-scale codebase. There are concrete gaps around library version drift (Hilt-testing 2.28-alpha vs Hilt 2.56), absent Macrobenchmark / baseline profiles, and no Firebase Test Lab integration in CI.

1. Tooling inventory

All versions sourced from gradle/libs.versions.toml and the Costco/build.gradle file.

Layer	Tool	Version	Purpose
Unit testing	JUnit 4	4.13.2	Primary test runner across all modules
	JUnit 5 (Jupiter)	6.0.3	Available but not adopted broadly
	MockK	1.14.9	Kotlin-first mocking library
	Mockito + mockito-inline	5.22.0 / 5.2.0	Java mocking; coexists with MockK
	Hamcrest	3.0	Assertion matchers
	kotlin-test	2.0.0-RC1	kotlin.test assertions (limited use)
Coroutines / Flow	kotlinx-coroutines-test	1.10.2	TestDispatcher / runTest
	Turbine	1.2.1	Flow assertion DSL
	androidx.arch.core:core-testing	2.2.0	InstantTaskExecutorRule for LiveData
UI automation	Espresso	3.7.0	XML-View based UI assertions
	Compose UI Test (junit4)	matches Compose 1.10.4	Composable semantics tree assertions
	androidx.test.ext:junit	1.3.0	AndroidJUnitRunner integration
	Test Orchestrator	1.6.1	Process isolation per test
	UiAutomator	2.3.0	Cross-app / system-dialog interaction
	Hilt-testing	2.28-alpha	@HiltAndroidTest entry point
Screenshots	Karumi Shot (plugin + runner)	6.1.0	Reference-image snapshot regression
Coverage	JaCoCo	0.8.8	Line + branch coverage; 40.81% threshold

2. Unit testing — what's in place

Test taxonomy

The ~170 unit-test files break down (approximately) as:

~60 in the Costco app module covering legacy logic (Pharmacy, ShoppingList, FindAStore, locale, webview) — the highest concentration of Java tests.
~40 across feature/* modules — ViewModel, UseCase, Mapper, Util tests. Examples: AccountViewModelTest, ProductDetailPageLandingViewModelTest, NativeSearchUseCaseImplTest, ProductPricingCardModelMapperTest.
~70 across shared/* modules — repository tests, network mappers, formatters. Examples: BffLayerRepositoryImplTest, ContentstackDeliveryRepositoryImplTest, SharedMembershipServiceTest, StringUtilsKtTest, FirebaseRemoteLoggerTest.

Conventions used

Aspect	Convention observed	Notes
File naming	`XxxTest.kt` / `XxxTest.java` (e.g. `AccountViewModelTest.kt`)	Consistent
Method naming	camelCase (e.g. `fetchUserData_returnsSuccess()`)	No backtick/spaced names; Given-When-Then not standardized
Mocking	MockK with `@MockK` + `MockKAnnotations.init(this, relaxUnitFun = true)`	Mockito coexists in legacy Java tests
Assertions	JUnit `Assert.assertEquals`, occasional Hamcrest matchers	Truth/AssertJ not adopted
Coroutines	Custom `MainCoroutineRule` + `UnconfinedTestDispatcher` + `TestScope`	Re-implemented in 7 different modules — see findings below
LiveData	`@get:Rule InstantTaskExecutorRule()`	Standard for ViewModel tests
Flow	Turbine `flow.test { … }`	Used inconsistently — some tests collect manually
Test fixtures	Hand-written fakes (e.g. `FakeDataStore.kt` in `shared/auth`)	Per-module; no shared `:testfixtures` module

Concrete file references

ViewModel test pattern: Costco/src/test/java/com/costco/app/android/ui/pharmacy/PharmacyViewModelTest.kt
Mapper test pattern: Costco/src/test/java/com/costco/app/android/warehouse/data/mapper/WarehouseMapperImplTest.kt
Repository test pattern: feature/productdetaillanding/src/test/java/com/costco/app/productdetaillanding/repo/ProductDetailPageLandingRepositoryImplTest.kt
Hand-written fake: shared/auth/src/test/java/com/costco/app/auth/util/FakeDataStore.kt
Custom coroutine rule (7 copies): feature/discover/src/test/java/com/costco/app/shop/MainCoroutineRule.kt · feature/account/src/test/java/com/costco/app/account/MainCoroutineRule.kt · feature/warehouse/src/test/java/com/costco/app/warehouse/MainCoroutineRule.kt · 4 others

3. Automation / UI testing — what's in place

Test types running on device / emulator

Test type	Framework	Where used
Compose component test	Compose UI Test (`createComposeRule()`)	`shared/sdui` (~40 tests), `shared/topbar`, `shared/navigationheader`
Espresso XML View test	Espresso `onView` / `onData`	Legacy screens in `Costco/src/androidTest`
Page Object Model E2E	Espresso + UiAutomator + Hilt	`Costco/src/androidTest/.../costcoUITests/pages/`
Hilt instrumentation test	`@HiltAndroidTest` + `HiltAndroidRule`	5+ modules; `shared/topbar/.../DefaultNavHeaderTest.kt`
Screenshot regression	Karumi Shot	`feature/account/.../MembershipCardComponentScreenShotTest.kt` + others under `*ScreenShotTest.kt`

Page Object Model

The androidTest tree under the main app module follows an organized POM-style structure with one file per surface:

Costco/src/androidTest/kotlin/com/costco/app/android/data/source/local/costcoUITests/pages/HomePageTest.kt
Costco/src/androidTest/kotlin/com/costco/app/android/data/source/local/costcoUITests/pages/ShopPageTest.kt
Costco/src/androidTest/kotlin/com/costco/app/android/data/source/local/costcoUITests/pages/SavingsPageTest.kt
Costco/src/androidTest/kotlin/com/costco/app/android/data/source/local/costcoUITests/pages/SavingsOffersTest.kt
Costco/src/androidTest/kotlin/com/costco/app/android/data/source/local/costcoUITests/pages/OnBoardingTest.kt
Centralized test data: .../costcoUITests/TestConstant.kt

UiAutomator falls in for system dialogs (e.g. permission prompts) — see UiDevice.getInstance() usage in HomePageTest.kt.

Compose UI Test pattern

Component-level tests use createComposeRule() + composeTestRule.setContent { … } to mount the Composable in isolation, then assert on semantics nodes:

@RunWith(AndroidJUnit4::class)
@HiltAndroidTest
class DefaultNavHeaderTest {
  @get:Rule(order = 0) val hiltRule = HiltAndroidRule(this)
  @get:Rule(order = 1) val composeTestRule = createComposeRule()

  @Test
  fun renders_brand_logo() {
    composeTestRule.setContent { CostcoTheme { DefaultNavHeader(...) } }
    composeTestRule.onNodeWithTag("brand_logo").assertIsDisplayed()
  }
}

Test runner configuration

The app module declares two runners depending on task:

Default: androidx.test.runner.AndroidJUnitRunner
For screenshot tests: com.karumi.shot.ShotTestRunner
Test orchestration: execution 'ANDROIDX_TEST_ORCHESTRATOR' — each test runs in a separate process to prevent state leakage.

Files: Costco/build.gradle (lines 69–80)

4. Coverage

JaCoCo is wired in via jacoco.gradle at the repo root (toolVersion 0.8.8), with a coverageCheck task that fails the build if line coverage drops below 40.81%.

Aspect	Detail
Coverage tool	JaCoCo 0.8.8
Threshold	40.81% (line)
Reports	HTML + XML, per build variant
Excluded from coverage	Activities, Fragments, Compose components, generated DI/Serialization/R/BuildConfig
Sonar / SonarCloud	Not detected in CI
PR-level coverage diff	Not detected

5. CI integration

Test execution runs on Azure Pipelines. The pipeline file is azure-pipelines.yml (~50 KB). Highlights:

PR triggers on develop, release-*, features/*.
Two-stage flow: Classify (PR metadata) → PR build (gradle build + tests).
Test results published via testResultsFiles: '**/TEST-*.xml'.
Gradle parallelism enabled (--max-workers).
Separate pipeline for security: azure-pipelines-nowsecure.yml runs NowSecure mobile security scans.
BrowserStack integration is referenced for web testing — no Firebase Test Lab.

6. Findings

PASS

Modern Kotlin-first unit-test stack

MockK + Turbine + Coroutines test + InstantTaskExecutorRule reflects current best practice. Repository / Mapper / ViewModel layers are well-tested.

PASS

Page Object Model adopted

E2E tests under costcoUITests/pages/ with TestConstant.kt for data — a maintainable pattern, especially with Hilt and Test Orchestrator.

PASS

Screenshot regression with Karumi Shot

Reference-image testing for design-system surfaces — catches visual regressions early. Files match *ScreenShotTest.kt.

HIGH

Hilt-testing version drift (2.28-alpha vs Hilt 2.56)

Production code uses Hilt 2.56 while hilt_testing is pinned at 2.28-alpha. Mismatched versions risk subtle bugs in test-time DI — generated factories may not align with runtime ones, and bug fixes from 2.28 → 2.56 are missed.

Recommendation: Bump hilt-android-testing to match hilt-android (2.56). Run all @HiltAndroidTest suites after the bump.

HIGH

Coverage threshold is too low (40.81%)

For a retail-scale Android app, a 40% threshold lets large untested surface areas slip through. Combined with the broad exclusion list (Activities, Fragments, Compose), real branch coverage is likely lower.

Recommendation: Stair-step the threshold quarterly toward 60–70% line coverage. Publish per-PR coverage diffs (Codecov or SonarCloud) so reviewers see what changed code is uncovered.

MEDIUM

`MainCoroutineRule` duplicated across 7 modules

The same TestWatcher-based coroutine rule is re-implemented in feature/discover, feature/account, feature/warehouse, and 4 others. Drift is inevitable; one module already uses StandardTestDispatcher while another uses UnconfinedTestDispatcher.

Recommendation: Create a shared/testfixtures module exposing MainCoroutineRule, FakeXxx repos, and TestData. Have feature modules testImplementation project(":shared:testfixtures").

MEDIUM

JUnit 4 + JUnit 5 both available

libs.versions.toml declares both junit 4.13.2 and Jupiter 6.0.3. JUnit 5 lifecycle (@BeforeEach vs @Before) and rule semantics differ; mixing causes confusion.

Recommendation: Pick one (JUnit 4 since most existing tests use it). Remove Jupiter from the catalog or document a clear migration plan.

MEDIUM

Mockito + MockK both present

Both are wired up; MockK is used for new Kotlin tests, Mockito persists in legacy Java tests. Cross-cutting helpers may target one library only, fragmenting test utilities.

Recommendation: Standardize on MockK for Kotlin. Keep Mockito only as long as legacy Java tests live; add a Detekt rule to forbid new Mockito imports in .kt files.

MEDIUM

Heavy reliance on mocks over fakes

Repositories and remote sources are typically mocked rather than substituted with hand-written fakes. Mocks couple tests to implementation; fakes describe behavior and survive refactors better.

Recommendation: Build a small library of fakes (FakeAccountRepository, FakeBffService) in the proposed :shared:testfixtures module. Reserve MockK for verifying interactions, not stubbing data.

MEDIUM

No Macrobenchmark / baseline profile

No :macrobenchmark module and no baseline-profile task. Cold-start, scrolling, and frame-drop regressions go unmeasured release-over-release.

Recommendation: Add a Macrobenchmark module that exercises Home → PDP → Cart. Generate a baseline profile and ship it in the APK to optimize cold-start.

MEDIUM

No Firebase Test Lab in CI

UI tests run on the local Gradle-managed device only. No matrix run across OEM device variants — Samsung One UI / Xiaomi MIUI / Pixel — where layout / WebView / camera bugs commonly surface.

Recommendation: Wire Firebase Test Lab (or BrowserStack App Live) into the nightly build with a representative 3–5 device matrix. Gate releases on these results.

LOW

Karumi Shot vs newer alternatives

Karumi Shot 6.1.0 still works but its successor ecosystem (Paparazzi, Roborazzi) runs without an emulator and is faster on CI. Roborazzi additionally integrates with Compose preview annotations.

Recommendation: Pilot Roborazzi on one feature module; evaluate runtime + flake rate vs Shot. If favorable, migrate over a quarter.

LOW

No Test Retry plugin in Gradle

UI / instrumentation tests can flake (animation, network). Without retry tracking, flakes are silent.

Recommendation: Apply org.gradle.test-retry with a hard ceiling (1 retry); track flake rate in a CI dashboard.

LOW

Test naming inconsistency

Mix of camelCase test method names; no enforced Given-When-Then or backtick-spaced names. Reviewers spend time inferring intent.

Recommendation: Adopt backtick-spaced naming (e.g. fun \`returns Loading then Success when fetch succeeds\`()) and document in a CONTRIBUTING.md test section.

INFO

Mutation testing absent

No Pitest / Stryker mutation testing. Coverage % alone says nothing about assertion quality.

Recommendation: Add Pitest on a single module as a pilot; use mutation score as a complementary quality signal.

7. Recommended target state

Capability	Today	Target (12 months)
Unit-test coverage	40.81% threshold	65–70% line coverage; PR-diff blocking on changed files
Coroutine rule	Duplicated across 7 modules	Single source in `:shared:testfixtures`
Mock vs Fake split	Mock-heavy	Fakes for repos/services; MockK for verifying interactions
Hilt-testing version	2.28-alpha	2.56 matching production
Screenshot tests	Karumi Shot	Roborazzi (or Paparazzi) for Compose; off-emulator
Performance testing	None	Macrobenchmark + baseline profile shipped in APK
Device matrix	Single GMD	Firebase Test Lab nightly across 3–5 devices
Mutation testing	None	Pitest on critical modules (productdetaillanding, account)
Flake handling	Manual	Gradle test-retry + flake dashboard
Test reporting	JUnit XML in Azure	Azure + Codecov/SonarCloud + PR diff annotations

Specific findings in this category

Costco Android · Code Review Report · Generated 2026-05-07 · 626 machine-curated findings

Testing

Executive summary

1. Tooling inventory

2. Unit testing — what's in place

Test taxonomy

Conventions used

Concrete file references

3. Automation / UI testing — what's in place

Test types running on device / emulator

Page Object Model

Compose UI Test pattern

Test runner configuration

4. Coverage

5. CI integration

6. Findings

Modern Kotlin-first unit-test stack

Page Object Model adopted

Screenshot regression with Karumi Shot

Hilt-testing version drift (2.28-alpha vs Hilt 2.56)

Coverage threshold is too low (40.81%)

MainCoroutineRule duplicated across 7 modules

JUnit 4 + JUnit 5 both available

Mockito + MockK both present

Heavy reliance on mocks over fakes

No Macrobenchmark / baseline profile

No Firebase Test Lab in CI

Karumi Shot vs newer alternatives

No Test Retry plugin in Gradle

Test naming inconsistency

Mutation testing absent

7. Recommended target state

Specific findings in this category

`MainCoroutineRule` duplicated across 7 modules