Note that not all features discussed in this file are included in the standard distribution. HISTORY June 21st 2001 1.4.2 Release Various new gcc's support Visual C++ 6.0 support uses configure (though could do so even more) substantial updates to the clunits unit selection module lots of wee bugs fixed a few very hard bugs fixed (client/server race condition) (dropped bytes in reading files when machine overloaded) (FreeBSD memory/gc problem) default waverform type is now RIFF. Nov 21st 1999 1.4.1 Release SSFF (for emulabel) track support AIX support Java fixes various minor bug fixes WFST with proper quoting Wagon sample counts gcc-2.95.1 support June 20th 1999 1.4.0 Release becomes free software June 6th 1999 1.3.95 Beta size/speed/memory leak overhaul (no memory leaks) XML support for relation loading (for SOLE support) JSAPI initial support GalaxyCommunicator architecture interface ked_mttilt_diphone voice built Parser trained on MARSEC (prosodic) brackets rather than syntax Unisyn_selection fully integrated Unisyn_phonology fully integrated viterbi cart/ngram/wfst base LTS prediction (did improve but BIG) viterbi cart/ngram based accent prediction (didn't improve) tilt working (again) audioin (na_record) for many architectures viterbi from Scheme (with cart, ngram, wfst models) January 26th 1999 1.3.1 Release egcs-1.1.1 support tobi_rules update (GM) replace readline with editline (+ extensions) Lots of little bug fixes cluster code tidied up kal voice ked power normalization updated lexicons with addenda for US and UK New LTS models for US and UK English "Building Voices in Festival" document August 24th 1998 1.3.0 Release UniSun/groupfile optimizations Java client support Fixed ESPS so both track and wave output works Retraining of most modules with new architecture (durations improved) rxp, (Richard's XML parser) integrated and Sable XMLified Fringe display program for labels and utterances Metrical tree synthesis A new utterance architecture (Relations and Items) utterance save and load work properly now Trainable LTS system Lexicon cache system Substantial optimization of front end (twice the speed) UniSyn, new signal processing and generic waveform synthesis module OLS code added WFST support for kk rules, regular grammars etc, simple English morphology November 30th 1997 1.2.4 BETA Tilt analysis and Tilt intonation modules added. make_utts substantially improved (> 100 times faster) text2wave script added Pitch synchronous lpc analysis and support rab consonant clusters labelled New duration tree (wagon stepwise) much smaller if not better SCFG grammar and parser (scfg_parse_text added as festival script) change config stuff (again) October 1st 1997 1.2.1 RELEASE preliminary support for Visual C++ Use path-append rather than string-append (in buckets of places) Minor bugs fixes throughout the code (end silences are now *always* inserted in tts) Linux socket bug fixed (get_url didn't work) native irix audio support September 5th 1997 1.2.0 RELEASE Proclaim modules and voices automatic detection of voices Phonset, lexicon, ltsrules listing and printing 16 bit linear native support for Solaris i386 (sb16) Update Festival Tutorial to 1.2.0 Aygust 15th 1997 1.1.99 Beta release Win NT (and 95) initial support Cygnus win32 and Visual C++ 100 more pages of documentation LPC analysis for voices now ESPS independent Spanish el voices tidy up (Borja) ToBI by rule implementation Confirmed support for gcc-2.7.2, gcc-2.6.3, Linux, FreeBSD, SunOS Alpha and SunCC port on Solaris reference card added return s-expressions in server/client mode OGI markup mode added. Native support for sun16, linux16 and freebsd (compile time option) Changed names of .C files to .cc files for bILL wagon integrated into speech tools (plus docs) auto-text-mode-alist for automatic selection of text mode from file name Associated token tests added Many more tokens dealt with (numbers, money, roman, phone, etc.) (analysed databases to see what coverage is like) A probablistic chart parser (no significant grammars though) RJC's new database/units/join/modify modules taking shape Some more examples added to the tutorial (with answers) Integrated CSLU changes for OGItoolkit including TCL support stml support for phrase types and words inline ssml -> stml Postlexical rules done in Scheme rather than C++ Rest of functions to allow any manipulation of utterance from scheme New duration models trained for both English and American New lexicon (CMU based) Consonant cluster support (for kd) American diphone set Cluster unit selection algorithm more robust Ngram backoff smoothing Token pos, for numbers (97.5%) but does poor on phone numbers New lexicon with final Rs and r deletion as postlex rule Update pos prediction (ts39) and phrase break ngrams (faster to load) New ngram format (binary files, and smoothing) Vowel reduction module Sun CC port New string class (rjc) remove dependence on libg++ Update of course notes and new section on building models from dbs Yarowsky homograph disambiguation Jan 24th 1997 1.1.1 release (first public release) a number of configuration and INSTALL documentation bugs fixed SSML tidied up and a festival script provided for it. Diphones, again, checked and copyright explicitly added Jan 6th 1997 1.1.0 release Roger diphones now default speaker A new unit clustering algorithm with acoustic costs and optimal coupling BSD socket client/server support A format function in Scheme (fprintf-like) A short course on Speech Synthesis in Festival (with course notes and exercises) A programmable form of text modes including externally customizable token to word rules. Fully programmable intonation module (for ToBI-like theories) Backtrace facility in Lisp Externally specified Utterance end (for all tts modes) with lookahead Roger diphones, first draft Nov 8th 1.0.0 release Substantial bug fixes, stabilization and documentation updates Added residual excited LPC synthesizer and removed PSOLA code. Made sucs and taylor optional modules, new modules can be added without modifying the base code MOBY lexicon (not as good as cuvoald but free) New diphone grouping software A new diphone database module (free from adc) Sept 30th 0.1 release MBROLA support (good example of external module) latest news: read out the latest news (from Time Warner, Pathfinder) audio spooler --language option on command line Spanish synthesis Letter to sound rules as external system (replacing all the NRL code) Welsh synthesis, making the whole system more language independent sucs spoke in reasonably way (gsw_450 and f2b dbs) document strings for functions (built in and user) and variables access from command line and dumped automatically into texinfo cleaned up SSML implementation break prediction integrated using viterbi and pos sucs module started (selection of units for concatenative synthesis) a part-of-speech tagging system (ngram/viterbi based) viterbi code added fixes in SIOD for running batch and stdin, also sub_prompts added saytime example Memory leaks fixed, no leaks for tts July 30th 0.0 release (just for the sake of it) a significiant start at documentation (texinfo -> info & html) festival scripts using #! on first line donovan diphone support can compile (with too many warnings) under g++ 2.7.2 copyrights on all files memory leak checks (only 8 bytes for "unknown" words) SSML (and tts file modes) cuvoald cmu and beep lexicons lexicon compilation web page, emacs interface June 2nd Klatt duration module syllabification in phones from letter to sound rules Linear Regression model for F0 prediction (from ToBI labels) CART (wagon) built trees for duration (zscores), phrase boundaries, accent and endtone prediction. ffeatures allowing specification of features of an utterance May integrated Taylor diphone module US Naval Research letter to sound rules CSTR lexicon 12th April first words "hello" start with speech_tools library, scheme-in-one-defun and readline and external CSTR diphone synthesizer 7th April 1996 work started