-
1
المؤلفون: Yuma Koizumi, Kohei Yatabe, Heiga Zen, Michiel Bacchiani
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Machine Learning, Audio and Speech Processing (eess.AS), Statistics - Machine Learning, FOS: Electrical engineering, electronic engineering, information engineering, Machine Learning (stat.ML), Computer Science - Sound, Machine Learning (cs.LG), Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::806a1952bf9d1889dbb9e0babbc78004
http://arxiv.org/abs/2210.01029 -
2
المؤلفون: Michael L. Seltzer, Reinhold Haeb-Umbach, Shinji Watanabe, Bjorn Hoffmeister, Heiga Zen, Michiel Bacchiani, Mehrez Souden, Tomohiro Nakatani
المصدر: IEEE Signal Processing Magazine. 36:111-124
مصطلحات موضوعية: Signal processing, business.industry, Computer science, Applied Mathematics, Speech recognition, Deep learning, Interface (computing), 020206 networking & telecommunications, Speech synthesis, 02 engineering and technology, computer.software_genre, Speech processing, Signal Processing, 0202 electrical engineering, electronic engineering, information engineering, Use case, Loudspeaker, Artificial intelligence, Electrical and Electronic Engineering, business, computer, Spoken language
-
3
المؤلفون: Llion Jones, Michiel Bacchiani, Yotaro Kubo, Shigeki Karita
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Computation and Language, Computer science, Character (computing), Speech recognition, Training methods, Computer Science - Sound, Tokenization (data security), Connectionism, Moving average, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Noise (video), Computation and Language (cs.CL), Word (computer architecture), Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::fa3b42d54fba63daa5b94328fa765023
-
4
المؤلفون: Yotaro Kubo, Michiel Bacchiani
المصدر: ICASSP
مصطلحات موضوعية: Dependency (UML), Computer science, Generalization, Speech recognition, 05 social sciences, SIGNAL (programming language), Grapheme, Multi-task learning, 010501 environmental sciences, 01 natural sciences, Task (project management), 0502 economics and business, 050207 economics, Encoder, 0105 earth and related environmental sciences
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::de172df60a54526dda191d7b10157318
https://doi.org/10.1109/icassp40776.2020.9054557 -
5
المؤلفون: Kevin W. Wilson, Kean Chin, Chanwoo Kim, Bo Li, Ananya Misra, Ron Weiss, Ehsan Variani, Andrew W. Senior, Tara N. Sainath, Izhak Shafran, Michiel Bacchiani, Arun Narayanan
المصدر: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25:965-979
مصطلحات موضوعية: Beamforming, Acoustics and Ultrasonics, Time delay neural network, Computer science, Speech recognition, Direction of arrival, Word error rate, 020206 networking & telecommunications, 02 engineering and technology, Filter (signal processing), Speech processing, Filter bank, Speech enhancement, 030507 speech-language pathology & audiology, 03 medical and health sciences, Computational Mathematics, 0202 electrical engineering, electronic engineering, information engineering, Computer Science (miscellaneous), Electrical and Electronic Engineering, 0305 other medical science
-
6
المؤلفون: Arun Narayanan, Galen Chuang, Zhongdi Qu, Rohit Prabhavalkar, Neeraj Gaur, Parisa Haghani, Pedro J. Moreno, Michiel Bacchiani, Austin Waters
المصدر: SLT
مصطلحات موضوعية: FOS: Computer and information sciences, Sound (cs.SD), Computer science, Natural language understanding, Word error rate, 010501 environmental sciences, computer.software_genre, Semantics, 01 natural sciences, Computer Science - Sound, Domain (software engineering), Set (abstract data type), 030507 speech-language pathology & audiology, 03 medical and health sciences, Audio and Speech Processing (eess.AS), Argument, FOS: Electrical engineering, electronic engineering, information engineering, 0105 earth and related environmental sciences, Computer Science - Computation and Language, business.industry, Task analysis, Artificial intelligence, 0305 other medical science, business, Computation and Language (cs.CL), computer, Natural language processing, Electrical Engineering and Systems Science - Audio and Speech Processing, Spoken language
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d1b2db66c60a737a66d9d88c906e0de7
https://doi.org/10.1109/slt.2018.8639043 -
7
المؤلفون: Khe Chai Sim, Mohamed G. Elfeky, Trevor Strohman, Ananya Misra, Michiel Bacchiani, Arun Narayanan, Anshuman Tripathi, Golan Pundak, Parisa Haghani
المصدر: SLT
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Science - Computation and Language, Noise measurement, Computer science, Speech recognition, Feature extraction, 020206 networking & telecommunications, 02 engineering and technology, Data modeling, Background noise, 030507 speech-language pathology & audiology, 03 medical and health sciences, Sampling (signal processing), Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, Codec, Invariant (mathematics), 0305 other medical science, Computation and Language (cs.CL), Utterance, Electrical Engineering and Systems Science - Audio and Speech Processing
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a6f8761113c076ea769bce14469f25b5
https://doi.org/10.1109/slt.2018.8639610 -
8
المؤلفون: Bo Li, Tara N. Sainath, Khe Chai Sim, Parisa Haghani, Anshuman Tripathi, Ananya Misra, Golan Pundak, Arun Narayanan, Michiel Bacchiani
المصدر: INTERSPEECH
مصطلحات موضوعية: 030507 speech-language pathology & audiology, 03 medical and health sciences, Domain adaptation, Computer science, Speech recognition, 0202 electrical engineering, electronic engineering, information engineering, 020206 networking & telecommunications, 02 engineering and technology, Hidden layer, 0305 other medical science
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::0e1980f15272c878d98b40c40ce369a4
https://doi.org/10.21437/interspeech.2018-2246 -
9
المؤلفون: Anjali Menon, Richard M. Stern, Michiel Bacchiani, Chanwoo Kim
المصدر: ICASSP
مصطلحات موضوعية: Reverberation, Channel (digital image), Noise measurement, Computer science, Acoustic model, 020206 networking & telecommunications, 02 engineering and technology, Weighting, 030507 speech-language pathology & audiology, 03 medical and health sciences, Noise, Angle of arrival, 0202 electrical engineering, electronic engineering, information engineering, 0305 other medical science, Algorithm
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::bee40e87ece14c610381bd1b7d6fcc5e
https://doi.org/10.1109/icassp.2018.8462269 -
10
المؤلفون: Michiel Bacchiani, Tom Bagby, Kamel Lahouel, Erik McDermott, Ehsan Variani
المصدر: ICASSP
مصطلحات موضوعية: Logarithm, business.industry, Computer science, Entropy (statistical thermodynamics), Sampling (statistics), Pattern recognition, 02 engineering and technology, 01 natural sciences, Upper and lower bounds, Cross entropy, Connectionism, Bias of an estimator, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, Entropy (information theory), 020201 artificial intelligence & image processing, Artificial intelligence, Entropy (energy dispersal), 010306 general physics, business, Entropy (arrow of time)
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::82e53064794a8122ac2433f604b4c9d8
https://doi.org/10.1109/icassp.2018.8461929