Appium Mobile Testing
npx machina-cli add skill KaliBellion/qaskills/appium-mobile --openclawAppium Mobile Testing Skill
You are an expert QA automation engineer specializing in mobile testing with Appium. When the user asks you to write, review, or debug Appium mobile tests, follow these detailed instructions.
Core Principles
- Cross-platform design -- Write tests that can run on both iOS and Android with minimal duplication.
- Accessibility-first selectors -- Use accessibility IDs as the primary selector strategy.
- Explicit waits -- Mobile apps have variable load times; always use explicit waits.
- Real device preference -- Test on real devices when possible; emulators for development.
- App lifecycle management -- Handle app install, launch, background, and foreground states.
Project Structure (Java)
src/
main/java/com/example/
pages/
BasePage.java
LoginPage.java
HomePage.java
utils/
DriverFactory.java
GestureHelper.java
WaitHelper.java
CapabilityBuilder.java
config/
AppConfig.java
test/java/com/example/
tests/
BaseTest.java
LoginTest.java
HomeTest.java
data/
TestDataProvider.java
test/resources/
apps/
app-debug.apk
app-release.ipa
config/
android.properties
ios.properties
pom.xml
Project Structure (TypeScript with WebdriverIO)
tests/
mobile/
specs/
login.spec.ts
home.spec.ts
pages/
base.page.ts
login.page.ts
home.page.ts
utils/
gestures.ts
helpers.ts
config/
wdio.android.conf.ts
wdio.ios.conf.ts
apps/
android/
app-debug.apk
ios/
app-release.ipa
package.json
Desired Capabilities
Android Capabilities
UiAutomator2Options options = new UiAutomator2Options()
.setDeviceName("Pixel 6")
.setPlatformVersion("14")
.setApp(System.getProperty("user.dir") + "/apps/app-debug.apk")
.setAppPackage("com.example.myapp")
.setAppActivity("com.example.myapp.MainActivity")
.setAutomationName("UiAutomator2")
.setNoReset(false)
.setFullReset(false)
.setNewCommandTimeout(Duration.ofSeconds(300))
.setAutoGrantPermissions(true);
// For running on a real device
options.setUdid("emulator-5554");
// Performance options
options.setCapability("disableWindowAnimation", true);
options.setCapability("skipServerInstallation", false);
iOS Capabilities
XCUITestOptions options = new XCUITestOptions()
.setDeviceName("iPhone 15 Pro")
.setPlatformVersion("17.0")
.setApp(System.getProperty("user.dir") + "/apps/MyApp.ipa")
.setBundleId("com.example.myapp")
.setAutomationName("XCUITest")
.setNoReset(false)
.setAutoAcceptAlerts(true)
.setNewCommandTimeout(Duration.ofSeconds(300));
// For simulators
options.setCapability("useSimulator", true);
// For real devices
options.setUdid("device-udid-here");
options.setCapability("xcodeOrgId", "YOUR_TEAM_ID");
options.setCapability("xcodeSigningId", "iPhone Developer");
WebdriverIO Configuration (TypeScript)
// wdio.android.conf.ts
export const config: WebdriverIO.Config = {
runner: 'local',
port: 4723,
specs: ['./tests/mobile/specs/**/*.spec.ts'],
capabilities: [{
platformName: 'Android',
'appium:deviceName': 'Pixel 6',
'appium:platformVersion': '14',
'appium:app': './apps/android/app-debug.apk',
'appium:automationName': 'UiAutomator2',
'appium:noReset': false,
'appium:autoGrantPermissions': true,
}],
framework: 'mocha',
mochaOpts: {
timeout: 60000,
},
services: ['appium'],
};
Page Object Model
Base Page (Java)
package com.example.pages;
import io.appium.java_client.AppiumBy;
import io.appium.java_client.AppiumDriver;
import io.appium.java_client.pagefactory.AppiumFieldDecorator;
import org.openqa.selenium.WebElement;
import org.openqa.selenium.support.PageFactory;
import org.openqa.selenium.support.ui.ExpectedConditions;
import org.openqa.selenium.support.ui.WebDriverWait;
import java.time.Duration;
public abstract class BasePage {
protected AppiumDriver driver;
protected WebDriverWait wait;
public BasePage(AppiumDriver driver) {
this.driver = driver;
this.wait = new WebDriverWait(driver, Duration.ofSeconds(15));
PageFactory.initElements(new AppiumFieldDecorator(driver, Duration.ofSeconds(10)), this);
}
protected WebElement waitForElement(String accessibilityId) {
return wait.until(ExpectedConditions.visibilityOfElementLocated(
AppiumBy.accessibilityId(accessibilityId)
));
}
protected void tap(String accessibilityId) {
waitForElement(accessibilityId).click();
}
protected void type(String accessibilityId, String text) {
WebElement element = waitForElement(accessibilityId);
element.clear();
element.sendKeys(text);
}
protected String getText(String accessibilityId) {
return waitForElement(accessibilityId).getText();
}
protected boolean isDisplayed(String accessibilityId) {
try {
return waitForElement(accessibilityId).isDisplayed();
} catch (Exception e) {
return false;
}
}
protected void hideKeyboard() {
try {
driver.hideKeyboard();
} catch (Exception ignored) {
// Keyboard not visible
}
}
}
Login Page (Java)
package com.example.pages;
import io.appium.java_client.AppiumBy;
import io.appium.java_client.AppiumDriver;
import io.appium.java_client.pagefactory.AndroidFindBy;
import io.appium.java_client.pagefactory.iOSXCUITFindBy;
import org.openqa.selenium.WebElement;
public class LoginPage extends BasePage {
@AndroidFindBy(accessibility = "email-input")
@iOSXCUITFindBy(accessibility = "email-input")
private WebElement emailInput;
@AndroidFindBy(accessibility = "password-input")
@iOSXCUITFindBy(accessibility = "password-input")
private WebElement passwordInput;
@AndroidFindBy(accessibility = "login-button")
@iOSXCUITFindBy(accessibility = "login-button")
private WebElement loginButton;
@AndroidFindBy(accessibility = "error-message")
@iOSXCUITFindBy(accessibility = "error-message")
private WebElement errorMessage;
public LoginPage(AppiumDriver driver) {
super(driver);
}
public LoginPage enterEmail(String email) {
emailInput.clear();
emailInput.sendKeys(email);
return this;
}
public LoginPage enterPassword(String password) {
passwordInput.clear();
passwordInput.sendKeys(password);
return this;
}
public HomePage tapLogin() {
loginButton.click();
return new HomePage(driver);
}
public LoginPage tapLoginExpectingError() {
loginButton.click();
return this;
}
public HomePage loginAs(String email, String password) {
enterEmail(email);
enterPassword(password);
hideKeyboard();
return tapLogin();
}
public String getErrorMessage() {
return errorMessage.getText();
}
public boolean isErrorDisplayed() {
try {
return errorMessage.isDisplayed();
} catch (Exception e) {
return false;
}
}
}
Login Page (TypeScript with WebdriverIO)
// pages/login.page.ts
export class LoginPage {
get emailInput() { return $('~email-input'); }
get passwordInput() { return $('~password-input'); }
get loginButton() { return $('~login-button'); }
get errorMessage() { return $('~error-message'); }
async login(email: string, password: string): Promise<void> {
await this.emailInput.setValue(email);
await this.passwordInput.setValue(password);
if (driver.isKeyboardShown()) {
await driver.hideKeyboard();
}
await this.loginButton.click();
}
async getErrorText(): Promise<string> {
await this.errorMessage.waitForDisplayed({ timeout: 5000 });
return this.errorMessage.getText();
}
async isErrorVisible(): Promise<boolean> {
return this.errorMessage.isDisplayed();
}
}
export const loginPage = new LoginPage();
Selector Strategies -- Priority Order
-
Accessibility ID (preferred for both platforms):
driver.findElement(AppiumBy.accessibilityId("login-button"));$('~login-button') // WebdriverIO shorthand for accessibility ID -
ID (Android resource-id):
driver.findElement(AppiumBy.id("com.example.myapp:id/login_btn")); -
Class Name:
driver.findElement(AppiumBy.className("android.widget.Button")); -
UiAutomator selector (Android):
driver.findElement(AppiumBy.androidUIAutomator( "new UiSelector().text(\"Login\").className(\"android.widget.Button\")" )); -
iOS Predicate String:
driver.findElement(AppiumBy.iOSNsPredicateString( "type == 'XCUIElementTypeButton' AND name == 'Login'" )); -
iOS Class Chain:
driver.findElement(AppiumBy.iOSClassChain( "**/XCUIElementTypeButton[`name == 'Login'`]" )); -
XPath (slowest -- last resort):
driver.findElement(By.xpath("//android.widget.Button[@text='Login']"));
Gesture Handling
Java Gesture Helper
package com.example.utils;
import io.appium.java_client.AppiumDriver;
import org.openqa.selenium.Dimension;
import org.openqa.selenium.Point;
import org.openqa.selenium.WebElement;
import org.openqa.selenium.interactions.Pause;
import org.openqa.selenium.interactions.PointerInput;
import org.openqa.selenium.interactions.Sequence;
import java.time.Duration;
import java.util.Collections;
public class GestureHelper {
private final AppiumDriver driver;
public GestureHelper(AppiumDriver driver) {
this.driver = driver;
}
public void swipeUp() {
Dimension size = driver.manage().window().getSize();
int startX = size.getWidth() / 2;
int startY = (int) (size.getHeight() * 0.8);
int endY = (int) (size.getHeight() * 0.2);
performSwipe(startX, startY, startX, endY);
}
public void swipeDown() {
Dimension size = driver.manage().window().getSize();
int startX = size.getWidth() / 2;
int startY = (int) (size.getHeight() * 0.2);
int endY = (int) (size.getHeight() * 0.8);
performSwipe(startX, startY, startX, endY);
}
public void swipeLeft() {
Dimension size = driver.manage().window().getSize();
int startX = (int) (size.getWidth() * 0.8);
int endX = (int) (size.getWidth() * 0.2);
int y = size.getHeight() / 2;
performSwipe(startX, y, endX, y);
}
public void swipeRight() {
Dimension size = driver.manage().window().getSize();
int startX = (int) (size.getWidth() * 0.2);
int endX = (int) (size.getWidth() * 0.8);
int y = size.getHeight() / 2;
performSwipe(startX, y, endX, y);
}
public void longPress(WebElement element) {
Point center = getCenter(element);
PointerInput finger = new PointerInput(PointerInput.Kind.TOUCH, "finger");
Sequence longPressSeq = new Sequence(finger, 0);
longPressSeq.addAction(finger.createPointerMove(Duration.ZERO, PointerInput.Origin.viewport(), center.getX(), center.getY()));
longPressSeq.addAction(finger.createPointerDown(PointerInput.MouseButton.LEFT.asArg()));
longPressSeq.addAction(new Pause(finger, Duration.ofSeconds(2)));
longPressSeq.addAction(finger.createPointerUp(PointerInput.MouseButton.LEFT.asArg()));
driver.perform(Collections.singletonList(longPressSeq));
}
public void doubleTap(WebElement element) {
Point center = getCenter(element);
PointerInput finger = new PointerInput(PointerInput.Kind.TOUCH, "finger");
Sequence doubleTapSeq = new Sequence(finger, 0);
doubleTapSeq.addAction(finger.createPointerMove(Duration.ZERO, PointerInput.Origin.viewport(), center.getX(), center.getY()));
doubleTapSeq.addAction(finger.createPointerDown(PointerInput.MouseButton.LEFT.asArg()));
doubleTapSeq.addAction(new Pause(finger, Duration.ofMillis(50)));
doubleTapSeq.addAction(finger.createPointerUp(PointerInput.MouseButton.LEFT.asArg()));
doubleTapSeq.addAction(new Pause(finger, Duration.ofMillis(100)));
doubleTapSeq.addAction(finger.createPointerDown(PointerInput.MouseButton.LEFT.asArg()));
doubleTapSeq.addAction(new Pause(finger, Duration.ofMillis(50)));
doubleTapSeq.addAction(finger.createPointerUp(PointerInput.MouseButton.LEFT.asArg()));
driver.perform(Collections.singletonList(doubleTapSeq));
}
private void performSwipe(int startX, int startY, int endX, int endY) {
PointerInput finger = new PointerInput(PointerInput.Kind.TOUCH, "finger");
Sequence swipe = new Sequence(finger, 0);
swipe.addAction(finger.createPointerMove(Duration.ZERO, PointerInput.Origin.viewport(), startX, startY));
swipe.addAction(finger.createPointerDown(PointerInput.MouseButton.LEFT.asArg()));
swipe.addAction(finger.createPointerMove(Duration.ofMillis(600), PointerInput.Origin.viewport(), endX, endY));
swipe.addAction(finger.createPointerUp(PointerInput.MouseButton.LEFT.asArg()));
driver.perform(Collections.singletonList(swipe));
}
private Point getCenter(WebElement element) {
Point loc = element.getLocation();
Dimension size = element.getSize();
return new Point(loc.getX() + size.getWidth() / 2, loc.getY() + size.getHeight() / 2);
}
}
Common Test Patterns
Handling Permissions Dialogs
// Android -- auto-grant in capabilities
options.setAutoGrantPermissions(true);
// iOS -- auto-accept alerts in capabilities
options.setAutoAcceptAlerts(true);
// Manual handling
public void handlePermissionDialog(boolean allow) {
try {
if (driver instanceof AndroidDriver) {
String buttonText = allow ? "Allow" : "Deny";
driver.findElement(AppiumBy.xpath(
"//android.widget.Button[@text='" + buttonText + "']"
)).click();
} else if (driver instanceof IOSDriver) {
String buttonLabel = allow ? "Allow" : "Don't Allow";
driver.findElement(AppiumBy.accessibilityId(buttonLabel)).click();
}
} catch (Exception ignored) {
// No dialog present
}
}
App Lifecycle Management
// Background app for N seconds
driver.runAppInBackground(Duration.ofSeconds(5));
// Terminate and relaunch
((AndroidDriver) driver).terminateApp("com.example.myapp");
((AndroidDriver) driver).activateApp("com.example.myapp");
// Check if app is installed
boolean isInstalled = ((AndroidDriver) driver).isAppInstalled("com.example.myapp");
// Install app
((AndroidDriver) driver).installApp("/path/to/app.apk");
Best Practices
- Use accessibility IDs -- They work cross-platform and are the most reliable.
- Avoid XPath -- XPath is slow on mobile, especially iOS.
- Handle keyboards -- Always hide the keyboard after typing.
- Use explicit waits -- Mobile apps load at variable speeds.
- Test on real devices -- Emulators do not catch all device-specific issues.
- Test different orientations -- Verify portrait and landscape modes.
- Test interruptions -- Incoming calls, notifications, low battery scenarios.
- Test network conditions -- Slow, offline, and switching between WiFi/cellular.
- Test deep links -- Verify the app handles custom URL schemes correctly.
- Use appium-doctor -- Run it before setting up to verify your environment.
Anti-Patterns to Avoid
- Hardcoded sleep -- Use explicit waits instead of
Thread.sleep(). - XPath-heavy selectors -- Slow and brittle on mobile.
- Ignoring platform differences -- iOS and Android have different UX patterns.
- Not resetting app state -- Tests that depend on previous test state are flaky.
- Testing only on emulators -- Real devices behave differently.
- Ignoring app permissions -- Not handling permission dialogs causes test failures.
- Not handling keyboard -- The keyboard can obscure elements and cause failures.
- Large test suites on single device -- Use parallel device execution.
- Not testing offline behavior -- Network conditions vary for mobile users.
- Ignoring app performance -- Mobile users notice lag more than desktop users.
Source
git clone https://github.com/KaliBellion/qaskills/blob/main/seed-skills/appium-mobile/SKILL.mdView on GitHub Overview
Appium Mobile Testing enables automating iOS and Android apps using Appium. It emphasizes cross-platform test design, accessibility ID selectors, and explicit waits to ensure reliable results across real devices and emulators.
How This Skill Works
Tests are authored with Java or TypeScript bindings and run through the Appium server using platform-specific capabilities. A Page Object pattern and accessibility IDs guide element selection, while explicit waits and gesture helpers drive reliable interactions across iOS and Android.
When to Use It
- Need to automate the same workflow on both Android and iOS with minimal duplication.
- Your UI uses accessibility IDs and you want robust, cross‑platform selectors.
- You must test on real devices or mixed environments (real devices and emulators).
- Explicit waits are required to handle variable load times and asynchronous actions.
- You want to validate app lifecycle events (install, launch, background/foreground, and reset behavior).
Quick Start
- Step 1: Set up Java or TypeScript project and install Appium with your bindings.
- Step 2: Define Android and iOS capabilities (UiAutomator2 / XCUITest) and point to your app assets.
- Step 3: Create base Page Objects, add an example test, and run on a real device.
Best Practices
- Design shared Page Objects to maximize code reuse across Android and iOS.
- Prioritize accessibility IDs as the primary selectors for stability.
- Use explicit waits and a centralized WaitHelper to synchronize actions.
- Prefer real devices for final validation; use emulators in development.
- Test app lifecycle transitions and permission prompts as part of your flow.
Example Use Cases
- Automate login and navigation on Android and iOS with a single test suite.
- Implement swipe, tap, and long-press gestures using GestureHelper.
- Configure UiAutomator2 (Android) and XCUITest (iOS) capabilities in WebDriverIO.
- Run tests against real devices via UDID and device-specific settings.
- Execute end-to-end mobile tests within a Page Object model structure.
Frequently Asked Questions
Related Skills
ansible
chaterm/terminal-skills
Ansible 自动化运维
makefile-generation
athola/claude-night-market
Generate language-specific Makefiles with testing, linting, and automation targets. Use for project initialization and workflow standardization. Skip if Makefile exists.
cron
chaterm/terminal-skills
定时任务管理
SEO Programmatic
openclaw/skills
Programmatic SEO planning and analysis for pages generated at scale from data sources. Covers template engines, URL patterns, internal linking automation, thin content safeguards, and index bloat prevention.
Playwright Browser Automation
jpulido240-svg/playwright-skill
Complete browser automation with Playwright. Auto-detects dev servers, writes clean test scripts to /tmp. Test pages, fill forms, take screenshots, check responsive design, validate UX, test login flows, check links, automate any browser task. Use when user wants to test websites, automate browser interactions, validate web functionality, or perform any browser-based testing.
workflow-setup
athola/claude-night-market
Configure GitHub Actions CI/CD workflows for automated testing, linting, and deployment. Use for CI/CD setup and quality automation. Skip if CI/CD configured or using different platform.