Multimodale Benutzerschnittstellen
Multimodale Benutzerschnittstellen > Materialien

Materialien

Multimodale Benutzerschnittstellen 2008

Folien zur Vorlesung


14.04.2008Einführung [pdf]
21.04.2008Automatic Speech Recognition I [pdf]
28.04.2008Automatic Speech Recognition II [pdf]
05.05.2008 Visuelle Perzeption I [pdf]
19.05.2008

Visuelle Perzeption II [pdf]

26.05.2008 Audio-visuelle Spracherkennung [pdf]
02.06.2008 Handwriting Recognition [pdf]
09.06.2008 Multimodale Fusion [pdf]
16.06.2008Multimodale Systeme [pdf]
23.06.2008Multimodaler Dialog [pdf]
30.06.2008User Studies & Evaluations [pdf]
07.07.2008Intelligent Environments [pdf]
14.07.2008Wiederholung/Zusammenfassung [pdf]

Ergänzende Papers/Quellen

Einführung
  • A. Waibel, M. T. Vo, P. Duchnowski, S. Manke, Multimodal Interfaces, Artificial Intelligence Review, Vol. 10, pp.299-319, 1995. [ps.gz]
  • R. Sharma, V. Pavlovic, T. Huang, Toward Multimodal Human-Computer Interface, Proceedings of the IEEE, Vol. 86, No. 5, May 1998. [pdf]
Gesichtsdetektion
  • S. L. Phung, A. Bouzerdoum and D. Chai, Skin Segmentation using color pixel classification: Analysis and Comparision, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 27, No. 1, January 2005. [pdf]
  • Neural Network Based Face Detection, by Henry A. Rowley, Shumeet Baluja, and Takeo Kanade. IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 20, number 1, pages 23-38, January 1998. [ps]
  • Paul Viola and Michael Jones, Rapid Object Detection using a Boosted Cascade of Simple Features, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - CVPR 2001, pp. 511-518, 2001. [pdf]
Kopfdrehung & Aufmerksamkeit
  • Model-based:
    • Andrew H. Gee and Roberto Cipolla, Non-Intrusive Gaze Tracking for Human-Computer Interaction, Proc. Mechatronics and Machine Vision in Practise, 1994. [ps]
    • R. Stiefelhagen, J. Yang and A. Waibel, A Model-Based Gaze Tracking System, International Journal of Artificial Intelligence Tools, Vol.6 (2), pp. 193-209, 1997. [ps.gz]
  • Neural-networks:
    • R. Stiefelhagen, J. Yang and A. Waibel,Simultaneous Tracking of Head Poses in a Panoramic View, International Conference on Pattern Recognition - ICPR 2000, pp. 726-729, Barcelona, 2000. [pdf]
  • Modeling Focus of Attention:
    • R. Stiefelhagen, Tracking Focus of Attention in Meetings, IEEE International Conference on Multimodal Interfaces - ICMI 2002, pp. 273-280, Pittsburgh, 2002. [pdf]
Personentracking
  • C. Wren, A. Azerbeidschani, T. Darrell, A. Pentland: Pfinder: Real-Time Tracking of the Human Body. IEEE Transactions on Pattern Analysis an Machine Intelligence, July 1997, vol 19, no 7, pp. 780-785. [pdf]
  • Mun Wai Lee, Isaac Cohen and Soon Ki Jung: Particle Filter with Analytical Inference for Human Body Tracking. Institute for Robotics and Intelligent Systems, Integrated Media Systems Center, University of South California, 2002. [pdf]
  • M. Isard and A. Blake, Condensation conditional density propagation for visual tracking, International Journal of Computer Vision 29(1), pp. 528, 1998. [pdf]
  • D. Focken, R. Stiefelhagen. Towards Vision-based 3-D People Tracking in a Smart Room. IEEE International Conference on Multimodal Interfaces, Pittsburgh, PA, USA, October 14-16, 2002, pp. 400-405. [pdf]
  • Nickel, K., Stiefelhagen, R.: 3D-Tracking of Heads and Hands for Pointing Gesture Recognition in a Human-Robot Interaction Scenario, Sixth Int. Conf. On Face and Gesture Recognition, May 2004, Seoul, Korea. [pdf]
Gestenerkennung
  • V.I. Pavlovic, R. Sharma, T.S. Huang: Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997. [pdf]
  • Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proc. IEEE, 77 (2), 257286, 1989. [pdf]
  • Becker, D.A.: Sensei: A Real-Time Recognition, Feedback and Training System for Tai Chi Gestures. M.I.T. Media Lab Perceptual Computing Group Technical Report No. 426, 1997. [pdf]
  • T. Starner, J. Weaver, A. Pentland: Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(12):1371--1375, 1998. [pdf]
  • Cassell, J.: A Framework For Gesture Generation And Interpretation. In Cipolla, R. and Pentland, A. (eds.), Computer Vision in Human-Machine Interaction, pp. 191-215. New York: Cambridge University Press. 1998. [pdf]
  • Poddar, I., Sethi, Y., Ozyildiz, E. and Sharma, R.: Toward Natural Gesture/Speech HCI: A Case Studyof Weather Narration. Proc. Workshops onPerceptual User Interfaces, pages 1-6, November, 1998. [pdf]
Spracherkennung
  • Vorlesung Automatische Spracherkennung [html]
  • Schukat-Talamazzini: Automatische Spracherkennung [ps]
  • E. G. Schukat-Talamazzini: Automatische Spracherkennung: Vieweg: Braunschweig/Wiesbaden 1995 [online]
  • Alex Waibel and Kai-Fu Lee (editors): Readings in Speech Recognition: Morgan Kaufmann Publishers, Inc.: San Mateo, CA 1990
  • F. Jelinek: Statistical Methods of Speech Recognition
Audio-Visuelle Spracherkennung
  • Uwe Meier, Rainer Stiefelhagen, Jie Yang, Alex Waibel: Towards Unrestricted Lipreading. ICMI 1999, Hong Kong. [ps]
  • U. Meier, W. Hrst, P. Duchnowski: Adaptive Bimodal Sensor Fusion For Automatic Speech Reading. ICASSP 1996, Atlanta, Georgia. [ps]
  • C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, D. Vergyri, J. Sison, A.Mashari, and J. Zhou: Audio-Visual Speech Recognition. Final Workshop 2000 Report, Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD (Oct. 12, 2000). [pdf]
Handschriftenerkennung
  • R. Plamondon, S. N. Srihari, On-Line and Off-line Handwriting Recognition: A Comprehensive Survey, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 1, January 2000 [pdf]
  • Tappert, C.C., Suen, C.Y., Wakahara, T.: The state of the art in online handwriting recognition. Pattern Analysis and Machine Intelligence, IEEE Transactions on Volume 12, Issue 8, Aug. 1990, Page(s):787 - 808 [pdf]
  • S. Jäger, S. Manke, A. Waibel: NPEN++: An online Handwriting Recognition System. 7th International Workshop on Frontiers in Handwriting Recognition, Amsterdam, 2000 [pdf]
  • Manke S., Finke M., Waibel A.: NPen++: a writer independent, large vocabulary on-line cursive handwriting. Proceedings of the Third International Conference on Document Analysis and Recognition, 1995, Volume 1, 14-16 Aug. 1995 Page(s):403 - 408 vol.1 [pdf]
  • A. Waibel et al.: Phoneme Recognition Using Time-Delay Neural Networks, IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. 31. No. 3. March 1989. [pdf]
Multimodal Fusion, Integration and Systems
  • Kittler et al., On Combining Classifiers, IEEE Trans. On Pattern Analysis
    and Machine Intelligence, Vol. 20(3), March 1998. [pdf]
  • R. Sharma, V. Pavlovic, T. Huang, Toward Multimodal Human-Computer Interface, Proceedings of the IEEE, Vol. 86, No. 5, May 1998. [pdf]
  • A. Waibel, M. T. Vo, P. Duchnowski, S. Manke, Multimodal Interfaces, Artificial Intelligence Review, Vol. 10, pp.299-319, 1995. [ps.gz]
  • Oviatt, DeAngeli and Kuhn, Integration and synchronization of input modes during multimodal human-computer interaction, CHI, pp. 415-422, 1997. [pdf]
  • Cohen, P.R., Johnston, M., McGee, D.R., Oviatt, S.L., Pittman, J., Smith, I., Chen, L., and Clow, J., QuickSet: Multimodal interaction for distributed applications, in the Proceedings of the Fifth International Multimedia Conference (Multimedia '97), ACM Press: Seattle, WA, November, pp. 31-40. [pdf]
  • Holzapfel et al., Implementation and Evaluation of a ConstraintBased Multimodal Fusion System for Speech and 3D Pointing Gestures, Proceedings of the International Conference on Multimodal Interfaces, (ICMI), State College, 2004. [pdf]
  • Johnston, M., Unification-based multimodal parsing, Proceedings of the 36th annual meeting on Association for Computational Linguistics - Volume 1, pp. 624-630, 1998. [pdf]
  • R. Stiefelhagen, C. Fuegen, P. Gieselmann, H. Holzapfel, K. Nickel, A. Waibel, Natural Human-Robot Interaction using Speech, Gaze and Gestures, IEEE/RSJ International Conference on Intelligent Robots and Systems, Sept.2004, Sendai, Japan. [pdf]