The correspondence between orthogra

The correspondence between orthography and pronunciation
in Modern Standard Arabic (MSA) falls
somewhere between that of languages such as Spanish
and Finnish, which have an almost one-to-one
mapping between letters and sounds, and languages
such as English and French, which exhibit a more
complex letter-to-sound mapping (El-Imam, 2004).
The more complex this mapping is, the more difficult
the language is for Automatic Speech Recognition
(ASR).
An essential component of an ASR system is its
pronunciation dictionary (lexicon), which maps the
orthographic representation of words to their phonetic
or phonemic pronunciation variants. For languages
with complex letter-to-sound mappings, such
dictionaries are typically written by hand. However,
for morphologically rich languages, such as MSA,1
pronunciation dictionaries are difficult to create by
hand, because of the large number of word forms,
each of which has a large number of possible pronunciations.
Fortunately, the relationship between
orthography and pronunciation is relatively regular
and well understood for MSA. Moreover, recent
automatic techniques for morphological analysis
and disambiguation (MADA) can also be useful
in automating part of the dictionary creation process
(Habash and Rambow, 2005; Habash and Rambow,
2007) Nonetheless, most documented Arabic ASR
systems appear to handle only a subset of Arabic
phonetic phenomena; very few use morphological
disambiguation tools.
In Section 2, we briefly describe related work, including
the baseline system we use. In Section 3, we
outline the linguistic phenomena we believe are critical
to improving MSA pronunciation dictionaries.
In Section 4, we describe the pronunciation rules we
have developed based upon these linguistic phenomena.
In Section 5, we describe how these rules are
used, together with MADA, to build our pronunciation
dictionaries for training and decoding automatically.
In Section 6, we present results of our evaluations
of our phone- and word-recognition systems
(XPR and XWR) onMSA compa

0/5000

From: -

To: -

Results (Thai) 1: [Copy]

Copied!

The correspondence between orthography and pronunciationin Modern Standard Arabic (MSA) fallssomewhere between that of languages such as Spanishand Finnish, which have an almost one-to-onemapping between letters and sounds, and languagessuch as English and French, which exhibit a morecomplex letter-to-sound mapping (El-Imam, 2004).The more complex this mapping is, the more difficultthe language is for Automatic Speech Recognition(ASR).An essential component of an ASR system is itspronunciation dictionary (lexicon), which maps theorthographic representation of words to their phoneticor phonemic pronunciation variants. For languageswith complex letter-to-sound mappings, suchdictionaries are typically written by hand. However,for morphologically rich languages, such as MSA,1pronunciation dictionaries are difficult to create byhand, because of the large number of word forms,each of which has a large number of possible pronunciations.Fortunately, the relationship betweenorthography and pronunciation is relatively regularand well understood for MSA. Moreover, recentautomatic techniques for morphological analysisand disambiguation (MADA) can also be usefulin automating part of the dictionary creation process(Habash and Rambow, 2005; Habash and Rambow,2007) Nonetheless, most documented Arabic ASRsystems appear to handle only a subset of Arabicphonetic phenomena; very few use morphologicaldisambiguation tools.In Section 2, we briefly describe related work, includingthe baseline system we use. In Section 3, weoutline the linguistic phenomena we believe are criticalto improving MSA pronunciation dictionaries.In Section 4, we describe the pronunciation rules wehave developed based upon these linguistic phenomena.In Section 5, we describe how these rules areused, together with MADA, to build our pronunciationdictionaries for training and decoding automatically.In Section 6, we present results of our evaluationsof our phone- and word-recognition systems(XPR and XWR) onMSA compa

Being translated, please wait..

Results (Thai) 2:[Copy]

Copied!

จดหมายระหว่างการสะกดการันต์และการออกเสียง
ในภาษาอาหรับมาตรฐานสมัยใหม่ (MSA) ตกอยู่
ที่ไหนสักแห่งระหว่างที่ของภาษาเช่นภาษาสเปน
และฟินแลนด์ซึ่งมีเกือบหนึ่งต่อหนึ่ง
การทำแผนที่ระหว่างตัวอักษรและเสียงและภาษา
เช่นภาษาอังกฤษและฝรั่งเศสซึ่งจัดแสดง ขึ้น
ซับซ้อนตัวอักษรต่อเสียงทำแผนที่ (El-อิหม่าม, 2004).
ที่มีความซับซ้อนมากขึ้นการทำแผนที่นี้เป็นที่ยากขึ้น
เป็นภาษาสำหรับการรู้จำเสียงอัตโนมัติ
(ASR).
เป็นองค์ประกอบสำคัญของระบบ ASR เป็นของ
พจนานุกรมการออกเสียง ( พจนานุกรม) ซึ่งแมป
เป็นตัวแทนของการสะกดคำการออกเสียงของพวกเขา
สายพันธุ์การออกเสียงหรือสัทศาสตร์ สำหรับภาษา
ที่มีตัวอักษรแมปเพื่อเสียงที่ซับซ้อนเช่น
พจนานุกรมถูกเขียนด้วยมือโดยทั่วไป อย่างไรก็ตาม
สำหรับภาษาที่อุดมไปด้วยสัณฐานเช่น MSA 1
พจนานุกรมการออกเสียงเป็นเรื่องยากที่จะสร้างโดย
มือเนื่องจากมีจำนวนมากของรูปแบบคำ
แต่ละที่มีจำนวนมากของการออกเสียงที่เป็นไปได้.
โชคดีที่ความสัมพันธ์ระหว่าง
การสะกดการันต์และการออกเสียงเป็น ค่อนข้างปกติ
และเป็นที่เข้าใจกันดีสำหรับ MSA นอกจากนี้ที่ผ่านมา
เทคนิคอัตโนมัติสำหรับการวิเคราะห์ลักษณะทางสัณฐานวิทยา
และวิกิพีเดีย (MADA) ยังสามารถเป็นประโยชน์
ในการทำงานอัตโนมัติเป็นส่วนหนึ่งของกระบวนการสร้างพจนานุกรม
(Habash และ Rambow 2005; Habash และ Rambow,
2007) อย่างไรก็ตามเอกสารส่วนใหญ่อาหรับ ASR
ระบบปรากฏจัดการเพียง ย่อยของภาษาอาหรับ
ปรากฏการณ์ออกเสียง; การใช้งานน้อยมากก้าน
เครื่องมือแก้ความกำกวม.
ในส่วนที่ 2 เราอธิบายงานที่เกี่ยวข้องรวมทั้ง
ระบบพื้นฐานที่เราใช้ ในข้อ 3 เรา
ร่างปรากฏการณ์ทางภาษาที่เราเชื่อว่ามีความสำคัญ
ในการปรับปรุง MSA พจนานุกรมการออกเสียง.
ในมาตรา 4 เราจะอธิบายกฎการออกเสียงที่เรา
ได้มีการพัฒนาขึ้นอยู่กับปรากฏการณ์ทางภาษาเหล่านี้.
ในมาตรา 5 เราจะอธิบายว่ากฎเหล่านี้จะถูก
นำมาใช้ร่วมกัน กับ MADA เพื่อสร้างการออกเสียงของเรา
พจนานุกรมสำหรับการฝึกอบรมและการถอดรหัสโดยอัตโนมัติ.
ในมาตรา 6 ที่เรานำเสนอผลการประเมินของเรา
ของการถือโทรศัพท์และคำรับรู้ระบบของเรา
(XPR และ XWR) compa onMSA

Being translated, please wait..

Results (Thai) 3:[Copy]

Copied!

ความสอดคล้องระหว่างเนื้อหาและการออกเสียงภาษาอาหรับมาตรฐานสมัยใหม่ ( MSA ) ตกระหว่างที่ของภาษา เช่น สเปนและฟินแลนด์ ซึ่งมีคนเกือบแผนที่ระหว่างตัวอักษรและเสียง และ ภาษาเช่นภาษาอังกฤษและภาษาฝรั่งเศสซึ่งแสดงเพิ่มเติมหนังสือเสียงที่ซับซ้อนการทำแผนที่ ( El อิหม่าม , 2004 )ที่ซับซ้อนมากขึ้น แมปนี้ คือ ยากภาษาสำหรับการรู้จำเสียงพูดอัตโนมัติ( ASR )องค์ประกอบที่สำคัญของระบบ ASR ของมันคือพจนานุกรมการออกเสียง ( 2 ) ซึ่งแผนที่เรื่องของการออกเสียงของคำแทนหรือการเปรียบเทียบพันธุ์ สำหรับภาษาจดหมายที่ซับซ้อนเสียงแมปดังกล่าวพจนานุกรมมักจะเขียนโดยมือ อย่างไรก็ตามจากที่อุดมไปด้วยสำหรับภาษาเช่น MSA , 1พจนานุกรมการออกเสียงก็ยากที่จะสร้างโดยมือ เพราะของจำนวนมากของรูปแบบคำแต่ละที่มีเป็นจำนวนมากของการออกเสียงที่เป็นไปได้โชคดีที่ความสัมพันธ์ระหว่างอักขรวิธี การออกเสียงค่อนข้างปกติและเข้าใจได้ดีสำหรับ MSA นอกจากนี้ ล่าสุดเทคนิคอัตโนมัติสำหรับการวิเคราะห์ก้านและการแก้ความกำกวม ( ตะกอน ) นอกจากนี้ยังสามารถเป็นประโยชน์ในส่วนของขั้นตอนการสร้างพจนานุกรมโดยอัตโนมัติ( และ บาช rambow , 2005 ; และ rambow บาช ,2007 ) ซึ่งส่วนใหญ่จะเป็นเอกสารภาษาอังกฤษขั้นสูงระบบจะปรากฏที่จะจัดการเฉพาะชุดย่อยของภาษาอาหรับปรากฏการณ์การออกเสียง ; น้อยมากที่ใช้โดยเครื่องมือเว็บมาสเตอร์ .ในส่วนที่ 2 เราจะอธิบายงานที่เกี่ยวข้อง ได้แก่พื้นฐานของระบบที่เราใช้ ในมาตรา 3 ให้เราร่างที่เราเชื่อว่ามีปรากฏการณ์ทางภาษาการปรับปรุงการปรับปรุงการออกเสียงพจนานุกรมในส่วนที่ 4 เราอธิบายการออกเสียงกฎเราได้พัฒนาตามปรากฏการณ์ทางภาษาเหล่านี้ในส่วนที่ 5 เราอธิบายว่า กฎเหล่านี้ใช้ร่วมกับตะกอน , การสร้างการออกเสียงของเราพจนานุกรมสำหรับการฝึกอบรมและถอดรหัสโดยอัตโนมัติในมาตรา ๖ เราเสนอผลการประเมินของเราโทรศัพท์ของเรา และระบบจดจำคำ( xpr และ xwr ) onmsa compa

Being translated, please wait..

Other languages

The translation tool support: Afrikaans, Albanian, Amharic, Arabic, Armenian, Azerbaijani, Basque, Belarusian, Bengali, Bosnian, Bulgarian, Catalan, Cebuano, Chichewa, Chinese, Chinese Traditional, Corsican, Croatian, Czech, Danish, Detect language, Dutch, English, Esperanto, Estonian, Filipino, Finnish, French, Frisian, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hausa, Hawaiian, Hebrew, Hindi, Hmong, Hungarian, Icelandic, Igbo, Indonesian, Irish, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Kinyarwanda, Klingon, Korean, Kurdish (Kurmanji), Kyrgyz, Lao, Latin, Latvian, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Maori, Marathi, Mongolian, Myanmar (Burmese), Nepali, Norwegian, Odia (Oriya), Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Samoan, Scots Gaelic, Serbian, Sesotho, Shona, Sindhi, Sinhala, Slovak, Slovenian, Somali, Spanish, Sundanese, Swahili, Swedish, Tajik, Tamil, Tatar, Telugu, Thai, Turkish, Turkmen, Ukrainian, Urdu, Uyghur, Uzbek, Vietnamese, Welsh, Xhosa, Yiddish, Yoruba, Zulu, Language translation.