On-device transcription · macOS · Private by default

Voice-first typing
for macOS.

Press fn, talk, release. Your words paste into the active app.

Slack · acme-team
# design
Design reviews, critiques, shipping
Listening
 
Message #design
Audio never leaves your Mac Works offline after first model download
Features

What Ghostly does

Local Whisper

Four model sizes: Small (466 MB), Medium (1.5 GB), Turbo (809 MB), Large-v3 (3.1 GB). GPU-accelerated on Apple Silicon via Metal.

One shortcut, any app

Push-to-talk or toggle. Pastes into whatever has focus — Slack, Xcode, Mail, Figma, Cursor, the URL bar.

Bring-your-own-key cleanup

OpenAI, Anthropic, Groq, OpenRouter, Cerebras, Z.AI, or Apple Intelligence on-device. Only the transcript is sent — never audio.

Screenshot + ask

One shortcut captures the screen, records the question, sends both to a vision model, pastes the answer.

Edit last paste

Say "make it shorter" or "change Monday to Tuesday" and the previous output gets rewritten in place. No re-typing.

Per-app profiles

Different system prompt + vocabulary per active app. Terse for Slack, full sentences for Mail, code-aware for your editor.

Voice commands

Map spoken phrases to keystrokes. "approve" sends , "reject" sends . Useful with Cursor and Claude Code.

"Scratch that"

Say it mid-sentence; everything before drops. Configurable trigger phrase so you can self-correct without re-recording.

Swappable engines

Whisper by default. Swap in Parakeet (Nvidia) for faster inference or Moonshine for English-only streaming. Same UI.

Who it's for

Talking is faster than typing.

Different jobs, same shortcut. Ghostly adapts to the app in focus.

For vibe coders

Dictate intent to Cursor and Claude Code at prompt speed. Map "approve" to and "reject" to so you never touch the keyboard.

For project managers

Reply to Slack, Linear, and Jira without breaking focus. Per-app profiles keep Slack terse and email paragraphs clean — no re-editing.

For executives

Turn hallway thinking into sendable emails. Dictate the gist; AI cleanup shapes it into a paragraph you'd actually send — not a transcript.

For writers

First drafts at speaking pace. Cleanup fixes punctuation and fillers but keeps your voice — not a ChatGPT rewrite unless you ask for one.

For sales

Dump call notes into HubSpot or Salesforce while it's fresh. Between meetings, not at the end of the day when you've forgotten the details.

For creatives

Brainstorm out loud straight into Figma comments, Notion, or your sketch app. Ideas land where you're working without a context-switch to a voice memo app.

Speed

faster than your keyboard.

Most people type around 40 WPM. Most people speak around 150. Ghostly narrows the gap between how fast you think and how fast your tools can keep up.

Keyboard
40 wpm
Speaking with Ghostly
150 wpm
That's a Slack reply in 5 seconds instead of 18. Across a day, that stacks up.
How it compares

Local-native, not cloud-first.

Most AI dictation tools ship your audio to a server and bill you monthly. Ghostly does neither — on-device transcription, choice of engine, and a one-time price.

Scroll horizontally to compare →

Ghostly Wispr Flow Willow Voice Voicy macOS Dictation
Private by default — audio never sent to a server Yes No — cloud only Opt-in mode No — cloud only Yes
Works offline Yes No Opt-in mode No Yes
AI cleanup (punctuation, fillers, formatting) Yes — BYO key Yes — included Yes — included Auto punctuation No
Choice of transcription engine Whisper · Parakeet · Moonshine Proprietary Proprietary Proprietary Apple only
Configurable per-app prompts Yes Auto style Auto format Auto format No
Screenshot + ask a vision model Yes No No No No
Monthly subscription Noneno subscription $15/ month $15/ month $8.49/ month Freebuilt-in
One-time / lifetime $39one-time not offered not offered $220lifetime n/a
Free tier 30 min / wkall features 2k words / wkcapped 2k words / wkcapped 40 min trialone-time Unlimitedbuilt-in
Language support

Speaks your language.

Whisper transcribes English, Spanish, French, German, Italian, Portuguese, Russian, Japanese, Mandarin, Arabic, Hindi — and plenty more — all on-device.

On-device. No cloud round-trip. No per-language setup.

How it works

Hold the shortcut, talk, release.

The transcript pastes into whatever field the cursor is in. Silences get skipped automatically.

fn hold
push to talk
01

Press your shortcut

Any key combo. Push-to-talk or toggle. Voice-activity detection skips silences automatically.

on-device whisper
02

Talk naturally

Whisper transcribes on-device. Say "scratch that" to self-correct. Numbers get converted by AI cleanup.

Ship variant B
pasted into active app
03

Release

Cleaned transcript pastes where your cursor is. No window switch, no copy step, no second thought.

AI cleanup

Bring your own model.

Connect an API key for OpenAI, Anthropic, Groq, OpenRouter, Cerebras, Z.AI, or any OpenAI-compatible endpoint. Ghostly runs the transcript through a model before pasting — fixing punctuation, formatting lists, converting "twenty-five percent" to 25%, stripping fillers.

On Apple Silicon, pick Apple Intelligence for on-device cleanup with no key and no network.

Without a key, nothing breaks — you get the raw Whisper output.

OpenAI Anthropic Groq OpenRouter Cerebras Z.AI Apple Intelligence
Live cleanup
transcribing
filler removed

um so I was thinking we could maybe push the launch to like the twenty fifth and then get marketing involved earlier this time uh yeah.

Cleaned

I was thinking we could push the launch to the 25th and get marketing involved earlier this time.

Per-app tone

Different tones for each app.

Ghostly adapts to whatever's in focus. Same voice input — different punctuation, casing, and tone per destination. Sound like you, not a transcript.

hey
Gmail
Gmail
Slack
Slack
Messages
Messages
Terminal
$ ghostly --toggle-transcription  # toggle recording
$ ghostly --cancel                 # cancel current op
$ ghostly --start-hidden           # launch to tray
$ ghostly --no-tray                # quit on close
$ ghostly --debug                  # verbose logging
CLI

Wire it into your stack.

Trigger Ghostly from a Raycast script, a Stream Deck macro, or a shell alias. Remote-control flags send to the running instance — no duplicate process.

Works anywhere

Any app with a cursor.

Ghostly pastes into whatever's in focus — editors, inboxes, chat, docs, terminals. Wherever your cursor lands.

Active input
Ghostly
Pricing

Generous free. Optional Pro.

Full feature parity on the free tier — every Whisper model, AI cleanup, all of it. Pro uncaps your time and lets you run it across three Macs. One-time purchase, no subscription.

Free

Forever
$0 No card required

Full feature parity, capped at casual-use volume.

  • 30 minutes of transcription per week
  • Install on 1 Mac
  • All Whisper models (Small, Medium, Turbo, Large)
  • AI cleanup (Apple Intelligence, OpenAI, Anthropic)
  • Per-app prompts & screenshot vision
  • 100% private — audio never leaves your Mac
Download for macOS
One-time purchase

Pro

Lifetime access
$39 one-time · no subscription

Uncap the limits. Pay once, keep forever.

  • Unlimited transcription time
  • Install on up to 3 Macs
  • All Whisper models (Small, Medium, Turbo, Large)
  • AI cleanup (Apple Intelligence, OpenAI, Anthropic)
  • Per-app prompts & screenshot vision
  • Free lifetime updates
Get Ghostly Pro — $39

License key delivered instantly by email. Activates every Mac you own.

Download

Install Ghostly

Ghostly is Apple Silicon only — any Mac with an M1 chip or later. Grab the latest .dmg, drag Ghostly to Applications, launch. It'll ask for microphone and accessibility access. Pick a shortcut in Settings. First run downloads a transcription model (500 MB–1.6 GB depending on which one you pick). After that, dictation works offline.

Apple Silicon only
M1 or later · macOS 12+
.dmg →
FAQ

Questions

Does my audio leave my Mac?

No. Transcription runs on-device via Whisper. If you enable optional AI cleanup with a cloud provider, only the resulting text is sent — never audio. Apple Intelligence keeps cleanup on-device too.

Is it free?

Yes. The free tier has full feature parity — all Whisper models, AI cleanup, per-app prompts, screenshot vision — capped at 30 minutes of transcription per week on one Mac. If you hit that cap, Ghostly Pro is a one-time $39 purchase that removes the time cap and lets you install on up to three Macs. No subscription.

What are the system requirements?

macOS 12 or later on Apple Silicon — any Mac with an M1 chip or later. Intel Macs are not supported. Microphone and accessibility permissions required.

Why Apple Silicon only?

Ghostly runs every AI model on-device — Whisper transcription via Metal, and optional Apple Intelligence cleanup via the Neural Engine. Real-time local dictation needs the unified memory and ML accelerators that only Apple Silicon has. Supporting Intel would mean either unusable latency or shipping your audio to a server — a trade-off we won't make.

Does it work offline?

Yes. After the first-run model download, dictation works offline. Optional AI cleanup needs a network connection unless you pick Apple Intelligence.

Windows, Linux, or Intel Mac?

No. Ghostly is Apple Silicon only — M1 or later. Intel Macs, Windows, and Linux are not supported.

Try it.

Local. Yours.