Claude Code Voice Mode (support.claude.com)

Brajeshwar 1 day ago

If you are already using a universal (OS-wide) tool for Voice, this should just work in Claude or otherwise. Even though I’m a smooth and pretty quick typer, I sometimes speak using Handy[1] and Claude types it out. Handy is seriously accurate.

It does look like more and more apps/tools are going to come built-in with - you can either type or talk - go wild.

Ask: Someone commented some time back that they mapped their CAPSLOCK key to push-to-talk. I’ve looked around and could not figure out without a third party tool. I’d love to know how to do it in macOS - map the un-used CAPSLOCK Key to Push to Talk.

1. https://handy.computer

nivertech 1 day ago

Claude Code's /voice mode isn't very accurate comparing to other speech-to-text (e.g. Google Translate).

Also it takes about 500ms to activate after you press SPACE, so the initial words are missed out. It took me some time to realize it.

Does Handy has a better speech-to-text accuracy? Is there an activation delay?

jaen 1 day ago

I've had success in the past in customizing macOS key bindings using Karabiner: https://karabiner-elements.pqrs.org/

Brajeshwar 42 minutes ago

Trying this one. It worked with a simple script mapping CAPSLOCK to the default Push-to-talk (OPTION + Space);

   ```
   {
    "description": "Caps Lock to Option+Space (Handy push-to-talk)",
    "manipulators": [
        {
            "from": { "key_code": "caps_lock" },
            "to": [
                {
                    "key_code": "spacebar",
                    "modifiers": ["left_option"]
                }
            ],
            "type": "basic"
        }
    ]
   }
   ```
ywatanabe1989 8 hours ago

If voice mode does not mean only interactive conversations, I use my custom TTS MCP server. https://github.com/ywatanabe1989/scitex-audio

It enables agents to deliver auditory feedback from speakers on my desk as a central notifier, relaying via ssh tunneling.

If this is off topic here, I apologize for that.

jmathai 1 day ago

Slightly related. Something I love to do is to tell Claude to generate a detailed explanation of a topic I want to learn more about (recently, it was Donor Advised Funds).

If I’m going to pick my kids up then I tell it to make it 10 minutes when read aloud. Then I listen to that on my drive.

emschwartz 1 day ago

This is great. It would be nice to be able to add a custom dictionary for words that are often misunderstood.

aeve890 1 day ago

I'm not seeing anything related to Claude Code. It's the desktop and mobile Claude app with the same feature I've already seen in competing products. Voice isn't even available in the remote control mode of Claude Code.

_aavaa_ 1 day ago

They’re doing a staged rollout. /voice will work in Claude code if they’ve rolled it out for your account.

furyofantares 1 day ago

Well, it's not mentioned at all in the linked page.

I tried /voice in Claude Code and it lets me hold space to get speech to text into my CLI. Not really what one expects from a "voice mode". And, again, not related to anything in the article.

nivertech 1 day ago

yes here voice mode != hands-off mode, so you can't DWC (Driving-While-Coding)

furyofantares 1 day ago

Yeah, doesn't really add much, I think if you want dictation your OS very likely has it built in. Certainly wouldn't be worth submitting to HN.