Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
485446 | Procedia Computer Science | 2016 | 8 Pages |
Abstract
We investigate modeling strategies for English code-switched words as found in a Swahili spoken term detection system. Code switching, where speakers switch language in a conversation, occurs frequently in multilingual environments, and typically de- teriorates STD performance. Analysis is performed in the context of the IARPA Babel program which focuses on rapid STD system development for under-resourced languages. Our results show that approaches that specifically target the modeling of code-switched words, significantly improve the detection performance of these words.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science (General)
Authors
Neil Kleynhans, William Hartman, Daniel van Niekerk, Charl van Heerden, Rich Schwartz, Stavros Tsakalidis, Marelie Davel,