Difference between revisions of "Speech Reading Group"

From PublicWiki
Jump to: navigation, search
(Potential Papers)
(Potential Papers)
Line 46: Line 46:
 
* [http://www1.cs.columbia.edu/~sbenus/Research/Gravano_et_al_Social_behavior_IS11.pdf Agustın Gravano, Rivka Levitan, Laura Willson, Stefan Benus, Julia Hirschberg, and Ani Nenkova, “Acoustic and prosodic correlates of social behavior,” Interspeech 2011, Florence.]
 
* [http://www1.cs.columbia.edu/~sbenus/Research/Gravano_et_al_Social_behavior_IS11.pdf Agustın Gravano, Rivka Levitan, Laura Willson, Stefan Benus, Julia Hirschberg, and Ani Nenkova, “Acoustic and prosodic correlates of social behavior,” Interspeech 2011, Florence.]
  
*  
+
* [http://mlsp.cs.cmu.edu/publications/ Sourish Chaudhuri, Bhiksha Raj. "Unsupervised Structure Discovery for Semantic Analysis of Audio", to appear in Neural Information Processing Systems (NIPS), 2012 ]
Coming soon...
+
 
 +
* [http://mlsp.cs.cmu.edu/publications/pdfs/spm.array.pdf Kenichi Kumatani, John McDonough, Bhiksha Raj. "Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-field Sensors", To appear in IEEE Signal Processing Magazine, 2012 ]
 +
 
 +
* [http://mlsp.cs.cmu.edu/publications/pdfs/isc12.pdf  Manas Pathak, José Portelo, Bhiksha Raj, Isabel Trancoso. "Privacy-preserving speaker authentication", Information Security Conference, Passau, 2012. ]
 +
 
 +
* [http://mlsp.cs.cmu.edu/publications/pdfs/sapa12.2.pdf  Kamal Sahni, Pranay Dighe, Rita Singh, Bhiksha Raj. "Language identification using spectro-temporal patch features". Proc. 5th ISCA workshop on statistical and perceptual audition (SAPA2012). 2012. ]
 +
 
 +
* [http://mlsp.cs.cmu.edu/publications/pdfs/.pdf Gahgene Gweon, Mahaveer Jain, John McDonough, Carolyn Rosé, Bhiksha Raj "Predicting Idea Co-Construction in Speech Data using Insights from Sociolinguistics", International Conference of the Learning Sciences, 2012.]
 +
 
 +
* [http://mlsp.cs.cmu.edu/publications/pdfs/Kumatani_ASRU2011.pdf Kenichi Kumatani, John McDonough, Bhiksha Raj, "Maximum kurtosis beamforming with a subspace filter for distant speech recognition", Automatic Speech Recognition and Understanding (ASRU) 2011]
 +
 
 +
* [http://mlsp.cs.cmu.edu/publications/pdfs/evandro.intersp2011.pdf Evandro Gouvea. "Hybrid speech recognition for voice search; a comparative studey", Interspeech 2011. Florence, 2011]
 +
 
 +
* Kshitiz Kumar, Bhiksha Raj, Rita Singh, Richard Stern. "An iterative least-squares technique for dereverberation", IEEE International Conference on Acoustics Speech and Signal Processing. Prague, 2011
 +
 
 +
 
 +
More coming soon, and feel free to add your own.

Revision as of 19:25, 9 January 2013

This is the wiki page for topics to be discussed in the Speech Reading Group, starting Winter 2013.

The group is meant to offer an opportunity for people working on or interested in speech technology to interact and share ideas, with the goals of fostering a larger sense of community amongst speech researchers at UW, and allowing participants to become familiar with the breadth of research going on here and in the speech community at large.


We will cover any topics with some relation to speech research, including but not limited to:

  • Speech Recognition
  • Speech Production
  • Feature Extraction
  • Speech Enhancement/Separation
  • Auditory scene analysis
  • Speech Perception
  • Cognitive Psychology of Speech

Administrative Meeting

There will be an initial administrative meeting as follows:

When: Thursday, January 10, 4:00 pm

Where: CSE 303

This meeting will be to gauge interest and size of the group, as well as to discuss some administrative matters, such as organization of the group, meeting times, location, and paper schedule.


For questions, or other, please contact either:

  • Gabe Schubiner - gabeos [at] cs.wash....edu
  • Scott Wisdom - swisdom [at] uw.edu

Potential Papers

  • Kshitiz Kumar, Bhiksha Raj, Rita Singh, Richard Stern. "An iterative least-squares technique for dereverberation", IEEE International Conference on Acoustics Speech and Signal Processing. Prague, 2011


More coming soon, and feel free to add your own.