GitHub - tdoris/xvoice: Speech interface controller for linux, allows user to control applications and dictate

tdoris / xvoice Public

Notifications You must be signed in to change notification settings
Fork 0
Star 11

Speech interface controller for linux, allows user to control applications and dictate

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
modules		modules
src		src
.cvsignore		.cvsignore
AUTHORS		AUTHORS
CODEGUIDE		CODEGUIDE
COMMANDS		COMMANDS
COPYING		COPYING
ChangeLog		ChangeLog
INSTALL		INSTALL
Makefile.am		Makefile.am
Makefile.in		Makefile.in
NEWS		NEWS
README		README
TODO		TODO
acconfig.h		acconfig.h
aclocal.m4		aclocal.m4
config.guess		config.guess
config.h.in		config.h.in
config.sub		config.sub
configure		configure
configure.in		configure.in
depcomp		depcomp
install-sh		install-sh
libtool		libtool
ltconfig		ltconfig
ltmain.sh		ltmain.sh
missing		missing
mkinstalldirs		mkinstalldirs
stamp-h.in		stamp-h.in
vimcmds		vimcmds
xvoice.png		xvoice.png
xvoice.spec.in		xvoice.spec.in
xvoice.xml		xvoice.xml

Repository files navigation

XVoice

Version 0.8, Thu Mar  9 14:17:37 PST 2000
by Brian Craft, while Tom is tied up on other things.
http://www.thecraftstudio.com/bcboy/xvoice
bcboy@thecraftstudio.com

Version 0.6, Nov 25, 1999.
http://www.compapp.dcu.ie/~tdoris/Xvoice
Tom Doris
tdoris@compapp.dcu.ie

Versions previous to 0.5 by:
http://www.zachary.com/creemer/xvoice.html
David Z. Creemer
david@zachary.com

XVoice accepts contuinuous speech input from IBM's ViaVoice runtime for Linux,
and then retarget the resulting text at many X applications. It uses the
XTest extension to accomplish this.

Speech input can be commands, such as "delete three lines", or dictation, such
as "You sent the attachment in Microsoft Word format, a secret proprietary
format, so I cannot read it." XVoice will map commands to key and mouse events,
as specified in the configuration file. It will map dictation to equivalent key
events, as if you had typed it. Command mode and dictation mode can be enabled
independently.

The ViaVoice engine requires some environment variables to be set. XVoice will
attempt to bootstrap the environment by invoking vvsetenv (shipped with the
ViaVoice run-time kit) and restarting. If this fails you need to set up your
environment by executing the following in your (bourne syntax) shell:

shell% . vvsetenv

XXX -- Is the following true? or just an XSendEvent relic?

In order to get emacs to work, you _must_ evaluate the following
elisp:

(setq x-allow-sendevents t)

BUILDING

XVoice requires X, ViaVoice SDK, GTK+, and libgnomeui to build. Some of the
code is based on the IBM ViaVoice SDK sample "gtkhello". If you can build that
sample program, you can build XVoice. See the Makefile for local settings.

RUNNING

"xvoice -h" will show you help on running the application.

"xvoice -m" starts XVoice with the microphone ON.

XVoice

xvoice actively tracks the focus window and loads whatever grammars it finds
for that window. It also keeps a history of any context grammars that were
loaded, so if the focus changes the context grammars can be automatically
reloaded when the window is re-focused. "command mode" and "dictate mode" may
be used to control which vocabularies are loaded.

Once started, XVoice presents a list of currently active vocabularies,
a list of recognised words (initially blank), and the current target
application (initially blank). Focus a target with the mouse, or by
saying "next window" until the desired application is focused, and say
"command mode" or "dictate mode" to begin sending voice commands.
Say "stop command" or "stop dictate" to stop each mode.

"Microphone off" will turn off the microphone.

While dictating into an application, speaking "stop dictation" will
return to XVoice command mode. Speaking "correction" will erase the
last dictated word or punctuation. You may erase words back to the
beginning of the dictation session. Speaking "new paragraph" will
enter a single CR for you (i.e. "enter").

Speaking "command mode" will put Xvoice into application command mode.  Xvoice
will attempt to find the grammar file for the focused window, and use this as
the current grammar. The xvoice.xml file contains mappings from spoken text to
synthesisted X events. Thus it is possible to speak "open file" and have the
key sequence "C-x C-f" sent to emacs. It is possible for users to modify the
xvoice.xml file to incorporate key sequences for other applications.  This
procedure is described in the COMMANDS file.

COPYRIGHT

Long Version:

XVoice is Copyright (c) 1999, 2000, 2001, 2002 by David Z. Creemer, Tom Doris,
Brian Craft, Jessica Heckman, Deborah Kaplan.

All rights not granted in the LICENSE file are reserved. XVoice is distributed
subject to the terms explained in the LICENSE file. Portions of this program
are based on code from other projects, which use the GPL and other licenses.
All attributions are in the AUTHORS file.

Short Version: GPL.