This repo lets you control your Windowsยฎ desktop by voice. Using keyboard, mouse, and also voice commands is intuitive, increases your productivity, and is really awesome.
Required is Windows 10 or 11, a headset, and the free Serenade voice recognition application.
- Download and install the free Serenade app from https://serenade.ai. During installation choose no plugin and no programming language, then go to Settings > Server and choose the 'Local' endpoint (good for privacy and speed). Close the application when finished.
- Download and unzip this repo from https://github.com/fleschutz/talk2windows/archive/main.zip.
- Open 'Windows PowerShell (Administrator)' and execute:
Set-ExecutionPolicy RemoteSigned
to allow the execution of PowerShell scripts. - Open 'Windows PowerShell' and execute:
./setup.ps1
in the unzipped talk2windows folder. It exports all voice commands with your personal wake word to Serenade. In the following we will use the wake word Windows.
- Put your headset on and check audio and microphone.
- Launch Serenade and click the slider to switch from Paused to Listening mode. Raise the mic audio level in case a 'low voice' is shown.
- Say: "Windows, open calculator" with a calm voice into the mic. This voice command executes the PowerShell script
scripts/open-calculator.ps1
which launches the calculator app.
Most voice commands follow the generic scheme "<wake word>, <verb> <subject>" such as: "Windows, open calculator". Supported are the following voice commands:
Say hello by replacing <greeting> with: good afternoon
, good evening
, good morning
, good night
, hello
, hey
, hi
, howdy
, morning
, or nice to meet you
.
Install a Microsoft Store application by replacing <app> with: 7-Zip
, CrystalDiskInfo
, CrystalDiskMark
, Discord
, Disney+
, Edge
, Firefox
, Google Chrome
, IrfanView
, Mp3tag
, Netflix
, Opera Browser
, Opera GX
, One Calendar
, Quick Share
, Rufus
, Spotify
, Sticky Notes
, TikTok
, Twitter
, Visual Studio Code
, VLC
, Windows Terminal
, or WhatsApp
. To uninstall it simply say: "Windows, uninstall <app>".
Launch an installed application by replacing <app> with: 3D-Viewer
, 7-Zip
, Calculator
, Character Map
, CrystalDiskInfo
, CrystalDiskMark
, Discord
, Disney+
, Edge
, Firefox
, Git Extensions
, Google Chrome
, Magnifier
, Microsoft Paint
, Microsoft Store
, Mp3tag
, Netflix
, Notepad
, OBS Studio
, One Calendar
, One Note
, Outlook
, Quick Share
, Remote Desktop
, Screen Clip
, Spotify
, Sticky Notes
, System Information
, Thunderbird
, Visual Studio
, VLC
, voice recorder
, Windows Terminal
, or WhatsApp
. To close it simply say: "Windows, close <app>".
Launch the default Web browser (or a new tab) with the given website. Replace <name> with: Amazon
, Apple
, Baidu
, BBC
, Bing
, BitBucket
, BitBucket status
, CDC
, CIA
, CNN
, Discord
, DistroWatch
, Docker
, Docker status
, Dropbox
, eBay
, Facebook
, FBI
, FlightRadar
, Flipboard
, FourSquare
, FRITZ!Box
, FRITZ!Repeater
, GitHub
, GliderTracker
, HolidayCheck
, HRworks
, Instagram
, IPFS
, Microsoft
, NASA
, NBC
, Netflix
, Notepad
, Outdoor Active
, PayPal
, Pinterest
, Pixabay
, Plex
, Serenade
, Slashdot
, Snap Store
, Space Weather
, SpaceX
, Spotify
, Starbucks
, Tesla
, TikTok
, Toggl
, Topo Map
, Twitter
, UFA
, Unsplash
, Walmart
, WhatsApp
, White House
, Windy
, Wikipedia
, Wolfram Alpha
, World News
, Yahoo
, YouTube
, or Zoom
. When finished say: "Windows, close tab".
Let the computer check something by replacing <item> with: audio
, Bitcoin rate
, CPU temperature
, dawn
, day
, DNS
, drives
, dusk
, Earth
(fun), Ether rate
, for crash dumps
, my downloads
, for rain
, for snow
, Git version
, headlines
, internet connection
, internet speed
, moon phase
, month
, New Year
, operating system
, Outlook
, ping latency
, recycle bin
, Santa
, swap space
, temperature
, Tether rate
, time zone
, up-time
, VPN
, weather
, week
, the wind
, year
, or zenith
.
Press a keyboard shortcut to navigate in a browser window. Replace <shortcut> with: close tab
, last tab
, next page
, next tab
, previous page
, previous tab
, open new tab
, open new window
, reload page
, scroll down
, scroll to bottom
, scroll to top
, scroll up
, tab 1
, tab 2
, tab 3
, tab 4
, tab 5
, tab 6
, zoom in
, or zoom out
.
Insert the given character at the current text cursor position. Just replace <char> with: at
, backslash
, comma
, copyright
, Dollar
, Euro
, hashtag
, minus
, number
, percent
, pipe
, plus
, Pound
, slash
, trademark
, underscore
, or Yen
.
Insert text at the current text cursor position. Replace <text> with: bomb
, bye bye
, call?
, checkmark
, clapping
, date
, good-bye
, good morning
, heart
, OK
, shrugging
, smiley
, or thumbs up
.
Launch the default Web browser (or a new tab) showing the given Googleยฎ service. Replace <service> with: Alerts
, Blogger
, Books
, Calendar
, Charts
, Contacts
, Docs
, Drawings
, Earth
, Flights
, Gemini
, Keep
, Mail
, Maps
, Mars
, News
, Notes
, Photos
, Play
, Scholar
, Search
, Sheets
, Shopping list
, Sky
, Slides
, Stadia
, Tasks
, or Translate
. When finished say: "Windows, close tab".
Launches the default Web browser (or a new tab) showing the given magazine website. Replace <name> with: Cosmopolitan
, Focus
, Golf
, GQ
, Health
, National Geographic
, People
, RollingStone
, Scientific American
, Spiegel
, Sports Illustrated
, SZ
, The Oprah
, Time
, Vanity Fair
, Vogue
, or Wired
. When finished say: "Windows, close tab".
Launches the default Web browser (or a new tab) with the given manual. Replace <name> with: Ant
, Apple
, Audacity
, Azure
, Bash
, Blender
, Chrome
, Edge
, Firefox
, GCC
, Git
, Jenkins
, MSBuild
, OBS Studio
, PowerShell
, Serenade
, SystemRescue
, Toyota
, Vim
, Voice
(yes, this page!), Volkswagen
, or Windows
. When finished say: "Windows, close tab".
Launches the default Web browser (or a new tab) showing the given city in Google Maps. Replace <name> with the name of the city. When finished say: "Windows, close tab".
Launches the default Web browser (or a new tab) showing the given exchange rate. Replace <name> with: Bitcoin
, Cardano
, Ether
, Euro
, NFT collections
, Solana
, Tether
, or US dollar
. When finished say: "Windows, close tab".
Launches the default Web browser (or a new tab) and plays a music genre. Replace <genre> with: 70s
, 80s
, 90s
, Alternative Rock
, Blues
, Chillout
, Christmas
, Classical
, Country
, Dance
, Folk
, Hip Hop
, House
, Indie
, Jazz
, Metal
, Pop
,Reggae
, Relax
, RnB
, Rock
, or Workout
. When finished say: "Windows, close tab".
Starts the playback of the given sound, just replace <name> with: alarm
, bee
, beep
, Big Ben
, car
, cat
, cow
, dog
, donkey
, elephant
, elk
, frog
, goat
, gorilla
, horse
, lion
, parrot
, pig
, rattlesnake
, vulture
, or wolf
.
Launches the default Web browser (or a new tab) and lets you play a free browser game. Replace <game> with: 2048
, Agar
, Catan Universe
, Chess
, Cube
, ISS Docking Simulator
, Pacman
, Santa Tracker
, Slither
, Tetris
, TicTacToe
, Tower
, Vanis
, War Brokers
, or Wordle
. When finished say: "Windows, close tab".
Replaces the desktop background by a random photo from Unsplash.com. Replace <category> with: Animal
, Beach
, Car
, Christmas
, City
, Desert
, Fantasy
, Military
, Movie
, Music
, Nature
, Plane
, Random
, Space
, Sports
, or Winter
.
Sets a reminder, just replace <time> with: at # AM
, at # PM
, at midnight
, at noon
, at sunrise
, at sunset
, at tea time
, in # minutes
, or in # hours
.
Launches the File Explorer showing the given drive. Replace <letter> with: C:
, D:
, E:
, F:
, G:
, H:
or M:
. When finished say: "Windows, close file explorer".
Launches the File Explorer with the given folder. Replace <name> with: apps
, applications
, autostart
, crash dumps
, desktop
, documents
, downloads
, Dropbox
, home
, music
, OneDrive
, pictures
, recycle bin
, repos
, SSH
, Talk2Windows
, temporary
, videos
, or Windows
. When finished say: "Windows, close file explorer".
Open the system settings with the given category - just replace <category> with: activation
, apps
, background
, backup
, bluetooth
, color
, date
, default apps
, developer
, display
, ethernet
, lockscreen
, maps
, personal
, printer
, proxy
, recovery
, speech
, start
, system
, taskbar
, themes
, time
, update
, USB
, VPN
, Wifi
, or Windows
. When finished say: "Windows, close system settings".
Shows the given table, just replace <table> with: bluetooth devices
, city weather
, earthquakes
, emojis
, environment variables
, headlines
, installed apps
, installed software
, installed voices
, network adapters
, printers
, running processes
, services
, system info
, tasks
, time zones
, or user groups
.
Performs the given action to the computer, replace <action> with: hibernate
, lock
, reboot
, shut down
, or suspend
.
Asks Windows a question, just replace <question> with: are you a computer?
, are you here?
, are you listening?
, are you ready?
, are you there?
, are you with me?
, can you talk?
, do you listen?
, how are you?
, how do you do?
, how much daylight today?
, how old are you?
, what about ...?
, what is [abbreviation]
, what's the date today?
, what's the temperature?
, what's the time?
, what time is it?
, when is Christmas?
, when is easter?
, when is midnight?
, when is noon?
, when is President's day?
, when is sunrise?
, when is sunset?
, when is tea time?
, where am I?
, where are you?
, where is the ISS
, who is your father?
, or you can talk?
.
Lets Windows execute a command, replace <command> with: calculate [expression]
, connect VPN
, count down from [number]
, empty recycle bin
, enable god mode
, I need help
, locate my phone
, minimize all windows
, ping [host]
, repeat last reply
, roll a dice
, save screenshot
, set volume to [number] percent
, shut up
, sorry
, spell [text]
, tell me a joke
, tell me a quote
, thank you
, thanks
, translate [text] to [language]
, turn volume off
, turn volume on
, turn volume up
, turn volume fully up
, turn volume down
, update repository
, or wakeup [hostname]
.
Say good-bye to your computer by replacing <farewell> with: bye
, bye-bye
, ciao
, good-bye
, I'll be back
, see you
, or see you later
.
Send your email feedback to: markus.fleschutz [at] gmail.com
This open source project is licensed under the CC0 license. All trademarks are the property of their respective owners.