Difference between revisions of "Speech Reading Group"

Revision as of 19:25, 9 January 2013

This is the wiki page for topics to be discussed in the Speech Reading Group, starting Winter 2013.

The group is meant to offer an opportunity for people working on or interested in speech technology to interact and share ideas, with the goals of fostering a larger sense of community amongst speech researchers at UW, and allowing participants to become familiar with the breadth of research going on here and in the speech community at large.

We will cover any topics with some relation to speech research, including but not limited to:

Speech Recognition
Speech Production
Feature Extraction
Speech Enhancement/Separation
Auditory scene analysis
Speech Perception
Cognitive Psychology of Speech

Administrative Meeting

There will be an initial administrative meeting as follows:

When: Thursday, January 10, 4:00 pm

Where: CSE 303

This meeting will be to gauge interest and size of the group, as well as to discuss some administrative matters, such as organization of the group, meeting times, location, and paper schedule.

For questions, or other, please contact either:

Gabe Schubiner - gabeos [at] cs.wash....edu
Scott Wisdom - swisdom [at] uw.edu

Potential Papers

Miranda, J., Neto, J. and Black A. Parallel combination of speech streams for improved ASR Interspeech 2012, Portland, OR.

Anumanchipalli, G., Oliveira, L., and Black, A., A Statistical Phrase/Accent Model for Intonation Modeling, Interspeech 2011 , Florence, Italy

Al-Haj, H., Hsiao, R., Lane, I., Black, A., and Waibel, A. "Pronunciation Modeling for Dialectal Arabic Speech Recognition" ASRU 2009, Merano, Italy.

Fadi Biadsy, Julia Hirschberg, "Using Prosody and Phonotactics in Arabic Dialect Identification," In Proceedings of Interspeech 2009, Brighton, UK.

Andrew Rosenberg, Julia Hirschberg, "Detecting Pitch Accents at the Word, Syllable, and Vowel Level," NAACL/HLT 2009, Boulder, CO.

Luciana Lucente, Julia Hirschberg and Plınio Barbosa, “Intonation, Discourse Structure and Information Status in Spontaneous Speech,” ETAP 2, Montreal. 2011

Agustın Gravano, Rivka Levitan, Laura Willson, Stefan Benus, Julia Hirschberg, and Ani Nenkova, “Acoustic and prosodic correlates of social behavior,” Interspeech 2011, Florence.

Sourish Chaudhuri, Bhiksha Raj. "Unsupervised Structure Discovery for Semantic Analysis of Audio", to appear in Neural Information Processing Systems (NIPS), 2012

Kenichi Kumatani, John McDonough, Bhiksha Raj. "Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-field Sensors", To appear in IEEE Signal Processing Magazine, 2012

Manas Pathak, José Portelo, Bhiksha Raj, Isabel Trancoso. "Privacy-preserving speaker authentication", Information Security Conference, Passau, 2012.

Kamal Sahni, Pranay Dighe, Rita Singh, Bhiksha Raj. "Language identification using spectro-temporal patch features". Proc. 5th ISCA workshop on statistical and perceptual audition (SAPA2012). 2012.

Gahgene Gweon, Mahaveer Jain, John McDonough, Carolyn Rosé, Bhiksha Raj "Predicting Idea Co-Construction in Speech Data using Insights from Sociolinguistics", International Conference of the Learning Sciences, 2012.

Kenichi Kumatani, John McDonough, Bhiksha Raj, "Maximum kurtosis beamforming with a subspace filter for distant speech recognition", Automatic Speech Recognition and Understanding (ASRU) 2011

Evandro Gouvea. "Hybrid speech recognition for voice search; a comparative studey", Interspeech 2011. Florence, 2011

Kshitiz Kumar, Bhiksha Raj, Rita Singh, Richard Stern. "An iterative least-squares technique for dereverberation", IEEE International Conference on Acoustics Speech and Signal Processing. Prague, 2011

More coming soon, and feel free to add your own.

@@ Line 46: / Line 46: @@
 * [http://www1.cs.columbia.edu/~sbenus/Research/Gravano_et_al_Social_behavior_IS11.pdf Agustın Gravano, Rivka Levitan, Laura Willson, Stefan Benus, Julia Hirschberg, and Ani Nenkova, “Acoustic and prosodic correlates of social behavior,” Interspeech 2011, Florence.]
-*
+* [http://mlsp.cs.cmu.edu/publications/ Sourish Chaudhuri, Bhiksha Raj. "Unsupervised Structure Discovery for Semantic Analysis of Audio", to appear in Neural Information Processing Systems (NIPS), 2012 ]
-Coming soon...
+* [http://mlsp.cs.cmu.edu/publications/pdfs/spm.array.pdf Kenichi Kumatani, John McDonough, Bhiksha Raj. "Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-field Sensors", To appear in IEEE Signal Processing Magazine, 2012 ]
+* [http://mlsp.cs.cmu.edu/publications/pdfs/isc12.pdf  Manas Pathak, José Portelo, Bhiksha Raj, Isabel Trancoso. "Privacy-preserving speaker authentication", Information Security Conference, Passau, 2012. ]
+* [http://mlsp.cs.cmu.edu/publications/pdfs/sapa12.2.pdf  Kamal Sahni, Pranay Dighe, Rita Singh, Bhiksha Raj. "Language identification using spectro-temporal patch features". Proc. 5th ISCA workshop on statistical and perceptual audition (SAPA2012). 2012. ]
+* [http://mlsp.cs.cmu.edu/publications/pdfs/.pdf Gahgene Gweon, Mahaveer Jain, John McDonough, Carolyn Rosé, Bhiksha Raj "Predicting Idea Co-Construction in Speech Data using Insights from Sociolinguistics", International Conference of the Learning Sciences, 2012.]
+* [http://mlsp.cs.cmu.edu/publications/pdfs/Kumatani_ASRU2011.pdf Kenichi Kumatani, John McDonough, Bhiksha Raj, "Maximum kurtosis beamforming with a subspace filter for distant speech recognition", Automatic Speech Recognition and Understanding (ASRU) 2011]
+* [http://mlsp.cs.cmu.edu/publications/pdfs/evandro.intersp2011.pdf Evandro Gouvea. "Hybrid speech recognition for voice search; a comparative studey", Interspeech 2011. Florence, 2011]
+* Kshitiz Kumar, Bhiksha Raj, Rita Singh, Richard Stern. "An iterative least-squares technique for dereverberation", IEEE International Conference on Acoustics Speech and Signal Processing. Prague, 2011
+More coming soon, and feel free to add your own.

Difference between revisions of "Speech Reading Group"

Revision as of 19:25, 9 January 2013

Administrative Meeting

Potential Papers

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools