Re: [gnuspeech-contact] introduction and gnuspeech questions

gnuspeech-contact

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [gnuspeech-contact] introduction and gnuspeech questions

From:	D.R. Hill
Subject:	Re: [gnuspeech-contact] introduction and gnuspeech questions
Date:	Mon, 14 Nov 2005 14:27:56 -0700 (MST)

Hi Eric,

Many thanks for your email and your strong interest in helping with the"gnuspeech" porting work.


I have replied to your questions in context.

I will help you in any way I can. Feel free to keep bugging me. It willbe a big help to have you improving and extending the port. Thank you.


All good wishes.

david


David Hill, Prof. Emeritus, Computer Science  |  Imagination is more       |
U. Calgary, Calgary, AB, Canada T2N 1N4       |  important than knowledge  |
address@hidden OR address@hidden   |         (Albert Einstein)  |
http://www.cpsc.ucalgary.ca/~hill             |  Kill your television      |
----

On Mon, 14 Nov 2005, Eric Zoerner wrote:

I am looking at gnuspeech and am considering getting involved in the portingand/or development effort. I am a postgraduate student at Trinity College,Dublin, doing a PhD in speech science. I have 17 years of commercialsoftware engineering experience working in Smalltalk, Java, C/C++, et. al. onUnix/Linux and Mac platforms. I am potentially interested in working withgnuspeech to explore voice control in speech synthesis as my dissertationtopic. I am being funded to do speech synthesis for the Irish language. Whilethe primary effort at this point is for a unit concatenative voice, I mayalso investigate what it would take to add synthesis for Irish to gnuspeech.
I have not yet downloaded the software from the cvs repository, but have beenreading some of the documentation. I have just a couple of questions at thispoint for the development team. The platform I would be interested in runninggnuspeech on and doing development on is Mac OS/X Tiger.
Questions:
- is it possible to generate audio output from gnuspeech on Mac OS/X at thispoint?

Yes. The Monet system allows "British/American" English speech to besynthesised at the segmental level, and the intonation can be addedautomatically or manually, under both 10.3 and 10.4, as described in theMonet manual. Monet is the most important tool in constructing the rulesand other databases for new languages and dialects. Steve Nygard did theport and made some improvements to the original. He has had to move ontoother things for the time being, but he was an ideal person to do the workas he had not only worked on the original NeXTSTEP version, but had wideexperience of Mac OS/X and porting problems as a member of OmniGroup.The Monet system is intended for interactive design of the speechdatabases. We need to make a stripped down version that can be part of aspeech server/daemon to provide speech as a service -- so-called "RealtimeMonet". There's a diagram on the savannah gnuspeech website that showsthe relationships.

I say "British/American" English because the basic data is British English(vowel and consonant targets, rhythm and intonation) but concessions weremade to American pronunciation in the dictionary entries -- especially theuse of somewthing approaching the American "r" sound. I was against that,but the majority, at the time, seemed to be convinced that with the USmarket as the biggest market, we should use a more "American" accent. Thisreally ought to be changed. Many Americans actually like a proper Britishaccent.

Also we need to port "Synthesizer", which is a GUI front end for the tubemodel. This is used to gain understanding of the tube model itself byproviding direct access to all the parameters, and it allows vocalconfigurations needed for a new language or dialect to be pinned down andquantified as a basis for creating posture data entries in the Monetsystem.

[Greg Casamento has modified (added to) the code that Steve Nygarddeveloped so that it can also be compiled for GNU/Linux, but there's stilla little work on that to be done. I haven't heard from Greg for a while.

Adam Fedor took the original NeXT 3.3 code and started the porting bymaking the changes appropriate to OpenStep, which was a good startingpoint for Steve's OS/X port.]

Finally, the dictionary editing tools should be ported and improved. Thisis a fairly small job.

- Are there any sound files available that give samples of the speech output?

Yes, I have attached a comparitive synthesis of "Hello" by male, femaleand child voices from the system. I can send more stuff, includingsentences and so on if you like.

- Is it still possible to get the software/hardware necessary to run the fullsuite of gnuspeech (i.e. NEXTstep?)?

Yes, it is on the savannah site, in the repository. The originalDeveloper kit and in-house tools software are there. You will need apassword to install and run the Developer kit, and you will find a bunchof valid passwords in the file "serial-nos.txt" also attached (you need a"Developer" serial number for the Developer kit -- 2nd half). [Hm! Ichecked the repository and, for some obscure reason, the equivalent file"gnuspeech/gnuspeech/trillium/priv/SerialNumbers/file" has been truncated.I'll have to fix that right away.] I probably should re-organise therepository to make it much more clear what is available, and add some ofthe oddments that run on the NeXT, like dictionary editing tools andinformation on the dictionary entry formats. The dictionary in the Userand Developer kits is encoded, but there is no longer any reason to dothis.

You should find the package for the most recent release of the NeXT 3.xDeveloper kit (release is 2.01) in"gnuspeech/gnuspeech/trillium/TextToSpeechKit/Release2.01/Dev" in therepository.


The Mac OS/X stuff is in the directory "current" rather than "gnuspeech".

As I say, feel free to bug me with any questions, requests or whatever.
Let me know if any other repository stuff appears to be damaged.

david

 >

Thanks for any info you can provide.

Eric Zoerner
School of Linguistic,  Speech and Communication Sciences
Trinity College Dublin


_______________________________________________
gnuspeech-contact mailing list
address@hidden
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact

helloComparison.snd
Description: Basic audio

serial-nos.txt
Description: Text document

[Prev in Thread]

Current Thread

[Next in Thread]

[gnuspeech-contact] introduction and gnuspeech questions, Eric Zoerner, 2005/11/14
- Re: [gnuspeech-contact] introduction and gnuspeech questions, D.R. Hill <=
  - [gnuspeech-contact] Re: introduction and gnuspeech questions, Eric Zoerner, 2005/11/14
    - Re: [gnuspeech-contact] Re: introduction and gnuspeech questions, D.R. Hill, 2005/11/14

Prev by Date: [gnuspeech-contact] introduction and gnuspeech questions
Next by Date: [gnuspeech-contact] Re: introduction and gnuspeech questions
Previous by thread: [gnuspeech-contact] introduction and gnuspeech questions
Next by thread: [gnuspeech-contact] Re: introduction and gnuspeech questions
Index(es):
- Date
- Thread