Get the FREE Ultimate OpenClaw Setup Guide →

automating-voice-memos

npx machina-cli add skill SpillwaveSolutions/automating-mac-apps-plugin/automating-voice-memos --openclaw
Files (1)
SKILL.md
4.1 KB

Automating Voice Memos (no dictionary, data+UI hybrid)

Relationship to the macOS automation skill

  • This is a standalone skill for Voice Memos automation.
  • For setup help, see the automating-mac-apps skill for permissions (Full Disk Access, Accessibility) and the ObjC bridge basics.
  • Prerequisites: Basic JXA (JavaScript for Automation) knowledge; install via macOS System Preferences > Security & Privacy.

Core Framing

  • Catalyst App: Voice Memos is an iOS app adapted for macOS without full macOS APIs, hence no AppleScript dictionary for automation.
  • UI-first: Prefer UI scripting/keyboard shortcuts for exports to avoid touching the database/container.
  • Data (optional): Use data-layer control only if needed (ObjC + sqlite3). This requires broader permissions.
  • Permissions: Accessibility for UI automation; Full Disk Access only if you read/write the container/DB.

Workflow (default)

  1. UI-first (no FDA): export via menu/shortcut to a folder you control.
  2. Optional data path: resolve storage paths; query CloudRecordings.db (Apple epoch +978307200) only if required.
  3. For UI actions (recording/export), drive the app with System Events (shortcuts preferred over clicks).
  4. For editing, prefer external tools (ffmpeg) after export; avoid writing directly into the container unless you accept FDA.

Quickstart (UI-only export; no Full Disk Access)

  • Open Voice Memos and select a recording.
  • Export/share UI: the Share sheet can send audio to Notes/other apps (no direct File > Export).
  • Transcript copy (UI-only, no FDA):
    • Run: osascript skills/automating-voice-memos/scripts/copy_transcript_to_file.applescript "/path/to/output.txt"
    • Defaults: Desktop/voice-memo-transcript.txt if no arg.
    • Shows transcript (if available), selects all, copies, and writes to the target file via clipboard.

Permissions

  • Accessibility: System Settings > Privacy & Security > Accessibility > Enable for System Events and your automation app (Terminal/Python/Script Editor).
  • Full Disk Access: Only if you read/write the Voice Memos container/DB directly. UI-only exports do not require FDA.
  • Verify: For UI-only, confirm you can open the Export menu and interact with the save dialog. For data access, verify ${home}/Library/Group Containers/group.com.apple.VoiceMemos.shared/.

Validation Checklist

After implementing Voice Memos automation:

  • Verify Accessibility permissions granted for Terminal/Script Editor
  • Confirm Voice Memos app opens and responds to UI scripting
  • Test transcript copy script executes without errors
  • Validate output file contains expected transcript text
  • For data access: verify Full Disk Access and database path exists

Troubleshooting

  • Permission denied: Verify Full Disk Access and Accessibility permissions.
  • Database locked: Close Voice Memos app before querying.
  • File not found: Check macOS version for correct storage path.
  • UI automation failures: Ensure Voice Memos is focused and Accessibility is enabled.

When Not to Use

  • For cross-platform audio recording (use ffmpeg or platform-agnostic tools)
  • When you need programmatic audio capture (use AVFoundation directly)
  • For iOS Voice Memos automation (no API available)
  • When Full Disk Access cannot be granted

What to load

  • Start with basics & prerequisites: automating-voice-memos/references/voice-memos-basics.md (setup and core concepts).
  • UI automation: automating-voice-memos/references/voice-memos-ui.md (shortcuts, AX scripting).
  • Data access (if needed): automating-voice-memos/references/voice-memos-data.md (storage paths, epochs, database schema).
  • Recipes: automating-voice-memos/references/voice-memos-recipes.md (exports, transcripts, recording control).

Source

git clone https://github.com/SpillwaveSolutions/automating-mac-apps-plugin/blob/main/plugins/automating-mac-apps-plugin/skills/automating-voice-memos/SKILL.mdView on GitHub

Overview

Automating Voice Memos on macOS lets you control the Voice Memos app (Mac Catalyst) with JXA, combining UI scripting and optional data access. It supports exporting recordings, reading from the Voice Memos database, and transcribing memos, all while guiding permissions and safe use of the data layer.

How This Skill Works

The skill uses JavaScript for Automation (JXA) to drive System Events for UI actions and keyboard shortcuts, minimizing reliance on internal APIs. If needed, it can switch to a data path using ObjC + sqlite3 to read the local CloudRecordings.db (Apple epoch offset +978,307,200), which requires Full Disk Access.

When to Use It

  • When you want to automate Voice Memos workflows on macOS
  • When exporting voice recordings via the app's UI or Share sheet
  • When you need to access the Voice Memos database (CloudRecordings.db)
  • When transcribing voice memos with the built-in transcript script
  • When you want to run UI-driven automation with System Events (no FDA for UI-only tasks)

Quick Start

  1. Step 1: Open Voice Memos and select a recording.
  2. Step 2: Use the UI Share/Export to send the audio to a target folder you control.
  3. Step 3: Run: osascript skills/automating-voice-memos/scripts/copy_transcript_to_file.applescript "/path/to/output.txt"

Best Practices

  • Prioritize UI scripting and keyboard shortcuts for exports to avoid touching the database
  • Only use the data path (ObjC + sqlite3) if you truly need direct data access
  • Drive actions with System Events rather than simulating clicks
  • Use the provided transcript script to copy transcripts when UI access is unavailable
  • Verify Accessibility and, when needed, Full Disk Access before data operations

Example Use Cases

  • Export a batch of memos to a controlled folder using the Share/Export UI
  • Copy transcripts to Desktop via the copy_transcript_to_file AppleScript
  • Query CloudRecordings.db to fetch memo metadata for archival
  • Automation that exports memos and re-encodes them with ffmpeg for a standard format
  • Archive exported memos into a project folder for a meeting

Frequently Asked Questions

Add this skill to your agents
Sponsor this space

Reach thousands of developers