Discussion:
[Freeswitch-users] pocketsphinx
Helmut Kuper
2009-07-10 12:53:50 UTC
Permalink
Hi,

I try to change pocketsphinx's grammar from default (english) to german.
I found this archive
(http://www.repository.voxforge1.org/downloads/de/Trunk/AcousticModels/), which
contains similar files like those which can be found in
grammar/model/communicator directory.

Unfortunately FS crashed without writing a core file nor logfile enries
as soon as as pizza demo trys to detect speech.

Any Ideas? Maybe someone has already working grammar/model files for
german language?


regards
helmut
Peter P GMX
2009-07-10 14:12:46 UTC
Permalink
Hello Helmut,

I looked at these dic files. Their content (look at all the qq's) is
quite different from the dic files supplied with freeswitch pocketsphinx.
As I remember the CMU dict file format has changed in April 2008.

Best regards
Peter
Post by Helmut Kuper
Hi,
I try to change pocketsphinx's grammar from default (english) to german.
I found this archive
(http://www.repository.voxforge1.org/downloads/de/Trunk/AcousticModels/), which
contains similar files like those which can be found in
grammar/model/communicator directory.
Unfortunately FS crashed without writing a core file nor logfile enries
as soon as as pizza demo trys to detect speech.
Any Ideas? Maybe someone has already working grammar/model files for
german language?
regards
helmut
_______________________________________________
Freeswitch-users mailing list
Freeswitch-users at lists.freeswitch.org
http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
http://www.freeswitch.org
Brian West
2009-07-10 14:24:01 UTC
Permalink
Yes you have to make sure you use the one that comes with Pocketsphinx
and not the 7.x one you download from the website. They aren't
compatible last I checked.

/b
Post by Peter P GMX
Hello Helmut,
I looked at these dic files. Their content (look at all the qq's) is
quite different from the dic files supplied with freeswitch
pocketsphinx.
As I remember the CMU dict file format has changed in April 2008.
Best regards
Peter
Brian West
2009-07-10 14:24:01 UTC
Permalink
Yes you have to make sure you use the one that comes with Pocketsphinx
and not the 7.x one you download from the website. They aren't
compatible last I checked.

/b
Post by Peter P GMX
Hello Helmut,
I looked at these dic files. Their content (look at all the qq's) is
quite different from the dic files supplied with freeswitch
pocketsphinx.
As I remember the CMU dict file format has changed in April 2008.
Best regards
Peter
Peter P GMX
2009-07-10 14:24:13 UTC
Permalink
Hello Helmut,

I looked at these dic files. Their content (look at all the qq's) is
quite different from the dic files supplied with freeswitch pocketsphinx.
As I remember the CMU dict file format has changed in April 2008.
Maybe there is a converter somewhere?

I was thinking of just enhancing the current dict file for some german
words I need, but did not test it so far. This should be possible
without modifying the underlying grammar.
http://en.wikipedia.org/wiki/CMU_Pronouncing_Dictionary
I would love to hear when you have had any progress on this.

Best regards
Peter
Post by Helmut Kuper
Hi,
I try to change pocketsphinx's grammar from default (english) to german.
I found this archive
(http://www.repository.voxforge1.org/downloads/de/Trunk/AcousticModels/), which
contains similar files like those which can be found in
grammar/model/communicator directory.
Unfortunately FS crashed without writing a core file nor logfile enries
as soon as as pizza demo trys to detect speech.
Any Ideas? Maybe someone has already working grammar/model files for
german language?
regards
helmut
_______________________________________________
Freeswitch-users mailing list
Freeswitch-users at lists.freeswitch.org
http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
http://www.freeswitch.org
Helmut Kuper
2009-07-13 07:53:51 UTC
Permalink
Hi Peter,

hmmm well, I had the same idea and I tested it! Buuuuut ... you have to
make sure that the english grammar/acousticModel is able to cover all
german noises. E.g. I tried to detect "Burke", "Jan" and "Gerd". I was
able to map Burke successfully in default.dic. Otherwise I had to say
"B?hrki" ... I did the same with "Jan" - but when I tried to detect Jan
I always got Gerd (with and witout mapping ind default.dic) ... Quite
strange and not really usable for (german) customers.

But some typical software magic happened on my way:
During my tests I had somehow a configuration using the voxforge files
which was working within FS. But I can't reproduce it. I configured
serveral files at the same time and used for reloading "reloadxml" and
"reload mod_pocketsphinx" instead of rebooting FS. When it worked, FS
was able to detect "Burke", "Jan" and "Gerd" correctly without modifying
the dictionary...


Is there any manual about pocketsphinx and its config files, which can
explain how PS is working in more detail? Currently I walk with a
flashlight in the dark ...

regards
Helmut
Post by Peter P GMX
Hello Helmut,
I looked at these dic files. Their content (look at all the qq's) is
quite different from the dic files supplied with freeswitch pocketsphinx.
As I remember the CMU dict file format has changed in April 2008.
Maybe there is a converter somewhere?
I was thinking of just enhancing the current dict file for some german
words I need, but did not test it so far. This should be possible
without modifying the underlying grammar.
http://en.wikipedia.org/wiki/CMU_Pronouncing_Dictionary
I would love to hear when you have had any progress on this.
Best regards
Peter
Peter P GMX
2009-07-13 12:55:00 UTC
Permalink
Hello Helmut,

the 3 mentioned words are already part of the englisch standard
dictionary, so maybe this causes the problem? You may test with words
which are outside of the standard grammar files or delete the original ones?
So far I have no other documentation available. This part of
PocketSphinx is rather poorly documented. And for the FS part I've only
got some information from this mailing list.

Best regards
Peter
Post by Helmut Kuper
Hi Peter,
hmmm well, I had the same idea and I tested it! Buuuuut ... you have to
make sure that the english grammar/acousticModel is able to cover all
german noises. E.g. I tried to detect "Burke", "Jan" and "Gerd". I was
able to map Burke successfully in default.dic. Otherwise I had to say
"B?hrki" ... I did the same with "Jan" - but when I tried to detect Jan
I always got Gerd (with and witout mapping ind default.dic) ... Quite
strange and not really usable for (german) customers.
During my tests I had somehow a configuration using the voxforge files
which was working within FS. But I can't reproduce it. I configured
serveral files at the same time and used for reloading "reloadxml" and
"reload mod_pocketsphinx" instead of rebooting FS. When it worked, FS
was able to detect "Burke", "Jan" and "Gerd" correctly without modifying
the dictionary...
Is there any manual about pocketsphinx and its config files, which can
explain how PS is working in more detail? Currently I walk with a
flashlight in the dark ...
regards
Helmut
Post by Peter P GMX
Hello Helmut,
I looked at these dic files. Their content (look at all the qq's) is
quite different from the dic files supplied with freeswitch
pocketsphinx.
Post by Peter P GMX
As I remember the CMU dict file format has changed in April 2008.
Maybe there is a converter somewhere?
I was thinking of just enhancing the current dict file for some german
words I need, but did not test it so far. This should be possible
without modifying the underlying grammar.
http://en.wikipedia.org/wiki/CMU_Pronouncing_Dictionary
I would love to hear when you have had any progress on this.
Best regards
Peter
_______________________________________________
Freeswitch-users mailing list
Freeswitch-users at lists.freeswitch.org
http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
http://www.freeswitch.org
Peter P GMX
2009-07-13 12:55:00 UTC
Permalink
Hello Helmut,

the 3 mentioned words are already part of the englisch standard
dictionary, so maybe this causes the problem? You may test with words
which are outside of the standard grammar files or delete the original ones?
So far I have no other documentation available. This part of
PocketSphinx is rather poorly documented. And for the FS part I've only
got some information from this mailing list.

Best regards
Peter
Post by Helmut Kuper
Hi Peter,
hmmm well, I had the same idea and I tested it! Buuuuut ... you have to
make sure that the english grammar/acousticModel is able to cover all
german noises. E.g. I tried to detect "Burke", "Jan" and "Gerd". I was
able to map Burke successfully in default.dic. Otherwise I had to say
"B?hrki" ... I did the same with "Jan" - but when I tried to detect Jan
I always got Gerd (with and witout mapping ind default.dic) ... Quite
strange and not really usable for (german) customers.
During my tests I had somehow a configuration using the voxforge files
which was working within FS. But I can't reproduce it. I configured
serveral files at the same time and used for reloading "reloadxml" and
"reload mod_pocketsphinx" instead of rebooting FS. When it worked, FS
was able to detect "Burke", "Jan" and "Gerd" correctly without modifying
the dictionary...
Is there any manual about pocketsphinx and its config files, which can
explain how PS is working in more detail? Currently I walk with a
flashlight in the dark ...
regards
Helmut
Post by Peter P GMX
Hello Helmut,
I looked at these dic files. Their content (look at all the qq's) is
quite different from the dic files supplied with freeswitch
pocketsphinx.
Post by Peter P GMX
As I remember the CMU dict file format has changed in April 2008.
Maybe there is a converter somewhere?
I was thinking of just enhancing the current dict file for some german
words I need, but did not test it so far. This should be possible
without modifying the underlying grammar.
http://en.wikipedia.org/wiki/CMU_Pronouncing_Dictionary
I would love to hear when you have had any progress on this.
Best regards
Peter
_______________________________________________
Freeswitch-users mailing list
Freeswitch-users at lists.freeswitch.org
http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
http://www.freeswitch.org
Helmut Kuper
2009-07-13 07:53:51 UTC
Permalink
Hi Peter,

hmmm well, I had the same idea and I tested it! Buuuuut ... you have to
make sure that the english grammar/acousticModel is able to cover all
german noises. E.g. I tried to detect "Burke", "Jan" and "Gerd". I was
able to map Burke successfully in default.dic. Otherwise I had to say
"B?hrki" ... I did the same with "Jan" - but when I tried to detect Jan
I always got Gerd (with and witout mapping ind default.dic) ... Quite
strange and not really usable for (german) customers.

But some typical software magic happened on my way:
During my tests I had somehow a configuration using the voxforge files
which was working within FS. But I can't reproduce it. I configured
serveral files at the same time and used for reloading "reloadxml" and
"reload mod_pocketsphinx" instead of rebooting FS. When it worked, FS
was able to detect "Burke", "Jan" and "Gerd" correctly without modifying
the dictionary...


Is there any manual about pocketsphinx and its config files, which can
explain how PS is working in more detail? Currently I walk with a
flashlight in the dark ...

regards
Helmut
Post by Peter P GMX
Hello Helmut,
I looked at these dic files. Their content (look at all the qq's) is
quite different from the dic files supplied with freeswitch pocketsphinx.
As I remember the CMU dict file format has changed in April 2008.
Maybe there is a converter somewhere?
I was thinking of just enhancing the current dict file for some german
words I need, but did not test it so far. This should be possible
without modifying the underlying grammar.
http://en.wikipedia.org/wiki/CMU_Pronouncing_Dictionary
I would love to hear when you have had any progress on this.
Best regards
Peter
Helmut Kuper
2009-07-31 13:18:43 UTC
Permalink
Hello,

I spent a few days working on my problem. I played around with
voxforge's data, read some of the train docs of CMU, and debugged the
sphinxbase, pocketsphinx and mod_pocketsphinx.

Results:
- -I found a way to use the voxforge data as training data for creation of
a german language corpus.

- -I enabled the logging of pocketsphinx to stderr (Dirty, but easy way to
see what went wrong when FS loads grammar, mdef, etc and simply stopped.
Very helpful!

- -I had to add a "dictcase" parameter to pocketsphinx.conf.xml resp
mod_pocketsphinx.c to allow case sensitive dictionaries (like the german
dictionary from voxforge).


FS starts up with german language model and detects the words as
expected. But it's not so reliable as I want to ... I guess this is
caused by the very small amount of training audio data. I used 4000 of
19000 audio files provided by voxforge due to the reason that voyforge's
training fileid-list contains only 4000 files ... I have to create new
fileid-list and transcription-lists containing all audio I have
downloaded from voxforge.


Quite complex the whole thing ...

regards
helmut
Post by Helmut Kuper
Hi,
I try to change pocketsphinx's grammar from default (english) to german.
I found this archive
(http://www.repository.voxforge1.org/downloads/de/Trunk/AcousticModels/), which
contains similar files like those which can be found in
grammar/model/communicator directory.
Unfortunately FS crashed without writing a core file nor logfile enries
as soon as as pizza demo trys to detect speech.
Any Ideas? Maybe someone has already working grammar/model files for
german language?
regards
helmut
Brian West
2009-07-31 13:37:30 UTC
Permalink
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hello,
I spent a few days working on my problem. I played around with
voxforge's data, read some of the train docs of CMU, and debugged the
sphinxbase, pocketsphinx and mod_pocketsphinx.
- -I found a way to use the voxforge data as training data for
creation of
a german language corpus.
Care to document the process?
- -I enabled the logging of pocketsphinx to stderr (Dirty, but easy way to
see what went wrong when FS loads grammar, mdef, etc and simply stopped.
Very helpful!
I can't recall if there is a logger callback we can register for
this... Last I checked you couldn't this is something we should make a
config option for up in the mod if possible.
- -I had to add a "dictcase" parameter to pocketsphinx.conf.xml resp
mod_pocketsphinx.c to allow case sensitive dictionaries (like the german
dictionary from voxforge).
What do you mean? Can you put this on jira please.
FS starts up with german language model and detects the words as
expected. But it's not so reliable as I want to ... I guess this is
caused by the very small amount of training audio data. I used 4000 of
19000 audio files provided by voxforge due to the reason that
voyforge's
training fileid-list contains only 4000 files ... I have to create new
fileid-list and transcription-lists containing all audio I have
downloaded from voxforge.
Quite complex the whole thing ...
regards
helmut
Michael Collins
2009-07-31 18:45:51 UTC
Permalink
Helmut,

Your hard work is appreciated. Like Brian said, we'd all be interested in
knowing more. Please feel free to put this on the wiki or see me off list
and we'll discuss further how to document it for the good of the FS
community.

Thanks!
-MC
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hello,
I spent a few days working on my problem. I played around with
voxforge's data, read some of the train docs of CMU, and debugged the
sphinxbase, pocketsphinx and mod_pocketsphinx.
- -I found a way to use the voxforge data as training data for creation of
a german language corpus.
- -I enabled the logging of pocketsphinx to stderr (Dirty, but easy way to
see what went wrong when FS loads grammar, mdef, etc and simply stopped.
Very helpful!
- -I had to add a "dictcase" parameter to pocketsphinx.conf.xml resp
mod_pocketsphinx.c to allow case sensitive dictionaries (like the german
dictionary from voxforge).
FS starts up with german language model and detects the words as
expected. But it's not so reliable as I want to ... I guess this is
caused by the very small amount of training audio data. I used 4000 of
19000 audio files provided by voxforge due to the reason that voyforge's
training fileid-list contains only 4000 files ... I have to create new
fileid-list and transcription-lists containing all audio I have
downloaded from voxforge.
Quite complex the whole thing ...
regards
helmut
Post by Helmut Kuper
Hi,
I try to change pocketsphinx's grammar from default (english) to german.
I found this archive
(http://www.repository.voxforge1.org/downloads/de/Trunk/AcousticModels/),
which
Post by Helmut Kuper
contains similar files like those which can be found in
grammar/model/communicator directory.
Unfortunately FS crashed without writing a core file nor logfile enries
as soon as as pizza demo trys to detect speech.
Any Ideas? Maybe someone has already working grammar/model files for
german language?
regards
helmut
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (MingW32)
iD8DBQFKcu8z4tZeNddg3dwRAiVKAKCJdAvZyYXLClWp+kQO4MhsPVzbSQCgoX/T
7NosCujHwpkCXHzxkEY/N3M=
=6Bu8
-----END PGP SIGNATURE-----
_______________________________________________
FreeSWITCH-users mailing list
FreeSWITCH-users at lists.freeswitch.org
http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
http://www.freeswitch.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freeswitch.org/pipermail/freeswitch-users/attachments/20090731/8e27b59b/attachment-0001.html
Helmut Kuper
2009-08-05 14:56:30 UTC
Permalink
Hi Michael,

today I put my documentation on FS wiki describing the steps to get a
8kHz sample rate acoustic model basing on voxforge's data for german
language. It's not complete, yet.

You can found it here: http://wiki.freeswitch.org/wiki/Mod_pocketsphinx

regards
Helmut
Post by Peter P GMX
Helmut,
Your hard work is appreciated. Like Brian said, we'd all be interested
in knowing more. Please feel free to put this on the wiki or see me off
list and we'll discuss further how to document it for the good of the FS
community.
Michael Collins
2009-08-05 22:02:28 UTC
Permalink
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hi Michael,
today I put my documentation on FS wiki describing the steps to get a
8kHz sample rate acoustic model basing on voxforge's data for german
language. It's not complete, yet.
You can found it here: http://wiki.freeswitch.org/wiki/Mod_pocketsphinx
regards
Helmut
Thanks!
I'd like to ask the community members who are interested in ASR and
PocketSphinx to please review Helmut's page and add to it as well as offer
feedback.
-MC
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freeswitch.org/pipermail/freeswitch-users/attachments/20090805/354efe35/attachment.html
Michael Collins
2009-08-05 22:02:28 UTC
Permalink
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hi Michael,
today I put my documentation on FS wiki describing the steps to get a
8kHz sample rate acoustic model basing on voxforge's data for german
language. It's not complete, yet.
You can found it here: http://wiki.freeswitch.org/wiki/Mod_pocketsphinx
regards
Helmut
Thanks!
I'd like to ask the community members who are interested in ASR and
PocketSphinx to please review Helmut's page and add to it as well as offer
feedback.
-MC
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freeswitch.org/pipermail/freeswitch-users/attachments/20090805/354efe35/attachment-0002.html
Helmut Kuper
2009-08-05 14:56:30 UTC
Permalink
Hi Michael,

today I put my documentation on FS wiki describing the steps to get a
8kHz sample rate acoustic model basing on voxforge's data for german
language. It's not complete, yet.

You can found it here: http://wiki.freeswitch.org/wiki/Mod_pocketsphinx

regards
Helmut
Post by Peter P GMX
Helmut,
Your hard work is appreciated. Like Brian said, we'd all be interested
in knowing more. Please feel free to put this on the wiki or see me off
list and we'll discuss further how to document it for the good of the FS
community.
Brian West
2009-07-31 13:37:30 UTC
Permalink
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hello,
I spent a few days working on my problem. I played around with
voxforge's data, read some of the train docs of CMU, and debugged the
sphinxbase, pocketsphinx and mod_pocketsphinx.
- -I found a way to use the voxforge data as training data for
creation of
a german language corpus.
Care to document the process?
- -I enabled the logging of pocketsphinx to stderr (Dirty, but easy way to
see what went wrong when FS loads grammar, mdef, etc and simply stopped.
Very helpful!
I can't recall if there is a logger callback we can register for
this... Last I checked you couldn't this is something we should make a
config option for up in the mod if possible.
- -I had to add a "dictcase" parameter to pocketsphinx.conf.xml resp
mod_pocketsphinx.c to allow case sensitive dictionaries (like the german
dictionary from voxforge).
What do you mean? Can you put this on jira please.
FS starts up with german language model and detects the words as
expected. But it's not so reliable as I want to ... I guess this is
caused by the very small amount of training audio data. I used 4000 of
19000 audio files provided by voxforge due to the reason that
voyforge's
training fileid-list contains only 4000 files ... I have to create new
fileid-list and transcription-lists containing all audio I have
downloaded from voxforge.
Quite complex the whole thing ...
regards
helmut
Michael Collins
2009-07-31 18:45:51 UTC
Permalink
Helmut,

Your hard work is appreciated. Like Brian said, we'd all be interested in
knowing more. Please feel free to put this on the wiki or see me off list
and we'll discuss further how to document it for the good of the FS
community.

Thanks!
-MC
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hello,
I spent a few days working on my problem. I played around with
voxforge's data, read some of the train docs of CMU, and debugged the
sphinxbase, pocketsphinx and mod_pocketsphinx.
- -I found a way to use the voxforge data as training data for creation of
a german language corpus.
- -I enabled the logging of pocketsphinx to stderr (Dirty, but easy way to
see what went wrong when FS loads grammar, mdef, etc and simply stopped.
Very helpful!
- -I had to add a "dictcase" parameter to pocketsphinx.conf.xml resp
mod_pocketsphinx.c to allow case sensitive dictionaries (like the german
dictionary from voxforge).
FS starts up with german language model and detects the words as
expected. But it's not so reliable as I want to ... I guess this is
caused by the very small amount of training audio data. I used 4000 of
19000 audio files provided by voxforge due to the reason that voyforge's
training fileid-list contains only 4000 files ... I have to create new
fileid-list and transcription-lists containing all audio I have
downloaded from voxforge.
Quite complex the whole thing ...
regards
helmut
Post by Helmut Kuper
Hi,
I try to change pocketsphinx's grammar from default (english) to german.
I found this archive
(http://www.repository.voxforge1.org/downloads/de/Trunk/AcousticModels/),
which
Post by Helmut Kuper
contains similar files like those which can be found in
grammar/model/communicator directory.
Unfortunately FS crashed without writing a core file nor logfile enries
as soon as as pizza demo trys to detect speech.
Any Ideas? Maybe someone has already working grammar/model files for
german language?
regards
helmut
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (MingW32)
iD8DBQFKcu8z4tZeNddg3dwRAiVKAKCJdAvZyYXLClWp+kQO4MhsPVzbSQCgoX/T
7NosCujHwpkCXHzxkEY/N3M=
=6Bu8
-----END PGP SIGNATURE-----
_______________________________________________
FreeSWITCH-users mailing list
FreeSWITCH-users at lists.freeswitch.org
http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
http://www.freeswitch.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freeswitch.org/pipermail/freeswitch-users/attachments/20090731/8e27b59b/attachment-0002.html
Helmut Kuper
2009-07-10 12:53:50 UTC
Permalink
Hi,

I try to change pocketsphinx's grammar from default (english) to german.
I found this archive
(http://www.repository.voxforge1.org/downloads/de/Trunk/AcousticModels/), which
contains similar files like those which can be found in
grammar/model/communicator directory.

Unfortunately FS crashed without writing a core file nor logfile enries
as soon as as pizza demo trys to detect speech.

Any Ideas? Maybe someone has already working grammar/model files for
german language?


regards
helmut
Peter P GMX
2009-07-10 14:12:46 UTC
Permalink
Hello Helmut,

I looked at these dic files. Their content (look at all the qq's) is
quite different from the dic files supplied with freeswitch pocketsphinx.
As I remember the CMU dict file format has changed in April 2008.

Best regards
Peter
Post by Helmut Kuper
Hi,
I try to change pocketsphinx's grammar from default (english) to german.
I found this archive
(http://www.repository.voxforge1.org/downloads/de/Trunk/AcousticModels/), which
contains similar files like those which can be found in
grammar/model/communicator directory.
Unfortunately FS crashed without writing a core file nor logfile enries
as soon as as pizza demo trys to detect speech.
Any Ideas? Maybe someone has already working grammar/model files for
german language?
regards
helmut
_______________________________________________
Freeswitch-users mailing list
Freeswitch-users at lists.freeswitch.org
http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
http://www.freeswitch.org
Peter P GMX
2009-07-10 14:24:13 UTC
Permalink
Hello Helmut,

I looked at these dic files. Their content (look at all the qq's) is
quite different from the dic files supplied with freeswitch pocketsphinx.
As I remember the CMU dict file format has changed in April 2008.
Maybe there is a converter somewhere?

I was thinking of just enhancing the current dict file for some german
words I need, but did not test it so far. This should be possible
without modifying the underlying grammar.
http://en.wikipedia.org/wiki/CMU_Pronouncing_Dictionary
I would love to hear when you have had any progress on this.

Best regards
Peter
Post by Helmut Kuper
Hi,
I try to change pocketsphinx's grammar from default (english) to german.
I found this archive
(http://www.repository.voxforge1.org/downloads/de/Trunk/AcousticModels/), which
contains similar files like those which can be found in
grammar/model/communicator directory.
Unfortunately FS crashed without writing a core file nor logfile enries
as soon as as pizza demo trys to detect speech.
Any Ideas? Maybe someone has already working grammar/model files for
german language?
regards
helmut
_______________________________________________
Freeswitch-users mailing list
Freeswitch-users at lists.freeswitch.org
http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
http://www.freeswitch.org
Helmut Kuper
2009-07-31 13:18:43 UTC
Permalink
Hello,

I spent a few days working on my problem. I played around with
voxforge's data, read some of the train docs of CMU, and debugged the
sphinxbase, pocketsphinx and mod_pocketsphinx.

Results:
- -I found a way to use the voxforge data as training data for creation of
a german language corpus.

- -I enabled the logging of pocketsphinx to stderr (Dirty, but easy way to
see what went wrong when FS loads grammar, mdef, etc and simply stopped.
Very helpful!

- -I had to add a "dictcase" parameter to pocketsphinx.conf.xml resp
mod_pocketsphinx.c to allow case sensitive dictionaries (like the german
dictionary from voxforge).


FS starts up with german language model and detects the words as
expected. But it's not so reliable as I want to ... I guess this is
caused by the very small amount of training audio data. I used 4000 of
19000 audio files provided by voxforge due to the reason that voyforge's
training fileid-list contains only 4000 files ... I have to create new
fileid-list and transcription-lists containing all audio I have
downloaded from voxforge.


Quite complex the whole thing ...

regards
helmut
Post by Helmut Kuper
Hi,
I try to change pocketsphinx's grammar from default (english) to german.
I found this archive
(http://www.repository.voxforge1.org/downloads/de/Trunk/AcousticModels/), which
contains similar files like those which can be found in
grammar/model/communicator directory.
Unfortunately FS crashed without writing a core file nor logfile enries
as soon as as pizza demo trys to detect speech.
Any Ideas? Maybe someone has already working grammar/model files for
german language?
regards
helmut
Loading...