Materialien Multimodale Benutzerschnittstellen 2008 Folien zur Vorlesung
| 14.04.2008 | Einführung [pdf] | | 21.04.2008 | Automatic Speech Recognition I [pdf] | | 28.04.2008 | Automatic Speech Recognition II [pdf] | | 05.05.2008 | Visuelle Perzeption I [pdf] | | 19.05.2008 | Visuelle Perzeption II [pdf] | | 26.05.2008 | Audio-visuelle Spracherkennung [pdf] | | 02.06.2008 | Handwriting Recognition [pdf] | | 09.06.2008 | Multimodale Fusion [pdf] | | 16.06.2008 | Multimodale Systeme [pdf] | | 23.06.2008 | Multimodaler Dialog [pdf] | | 30.06.2008 | User Studies & Evaluations [pdf] | | 07.07.2008 | Intelligent Environments [pdf]
| | 14.07.2008 | Wiederholung/Zusammenfassung [pdf] | Ergänzende Papers/Quellen Einführung
- A. Waibel, M. T. Vo, P. Duchnowski, S. Manke, Multimodal Interfaces, Artificial Intelligence Review, Vol. 10, pp.299-319, 1995. [ps.gz]
- R. Sharma, V. Pavlovic, T. Huang, Toward Multimodal Human-Computer Interface, Proceedings of the IEEE, Vol. 86, No. 5, May 1998. [pdf]
Gesichtsdetektion - S. L. Phung, A. Bouzerdoum and D. Chai, Skin Segmentation using color pixel classification: Analysis and Comparision, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 27, No. 1, January 2005. [pdf]
- Neural Network Based Face Detection, by Henry A. Rowley, Shumeet Baluja, and Takeo Kanade. IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 20, number 1, pages 23-38, January 1998. [ps]
- Paul Viola and Michael Jones, Rapid Object Detection using a Boosted Cascade of Simple Features, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - CVPR 2001, pp. 511-518, 2001. [pdf]
Kopfdrehung & Aufmerksamkeit - Model-based:
- Andrew H. Gee and Roberto Cipolla, Non-Intrusive Gaze Tracking for Human-Computer Interaction, Proc. Mechatronics and Machine Vision in Practise, 1994. [ps]
- R. Stiefelhagen, J. Yang and A. Waibel, A Model-Based Gaze Tracking System, International Journal of Artificial Intelligence Tools, Vol.6 (2), pp. 193-209, 1997. [ps.gz]
- Neural-networks:
- R. Stiefelhagen, J. Yang and A. Waibel,Simultaneous Tracking of Head Poses in a Panoramic View, International Conference on Pattern Recognition - ICPR 2000, pp. 726-729, Barcelona, 2000. [pdf]
- Modeling Focus of Attention:
- R. Stiefelhagen, Tracking Focus of Attention in Meetings, IEEE International Conference on Multimodal Interfaces - ICMI 2002, pp. 273-280, Pittsburgh, 2002. [pdf]
Personentracking-
C. Wren, A. Azerbeidschani, T. Darrell, A. Pentland: Pfinder: Real-Time Tracking of the Human Body. IEEE Transactions on Pattern Analysis an Machine Intelligence, July 1997, vol 19, no 7, pp. 780-785. [ pdf] Mun Wai Lee, Isaac Cohen and Soon Ki Jung: Particle Filter with Analytical Inference for Human Body Tracking. Institute for Robotics and Intelligent Systems, Integrated Media Systems Center, University of South California, 2002. [ pdf] -
M. Isard and A. Blake, Condensation conditional density propagation for visual tracking, International Journal of Computer Vision 29(1), pp. 528, 1998. [ pdf] D. Focken, R. Stiefelhagen. Towards Vision-based 3-D People Tracking in a Smart Room. IEEE International Conference on Multimodal Interfaces, Pittsburgh, PA, USA, October 14-16, 2002, pp. 400-405. [ pdf] Nickel, K., Stiefelhagen, R.: 3D-Tracking of Heads and Hands for Pointing Gesture Recognition in a Human-Robot Interaction Scenario, Sixth Int. Conf. On Face and Gesture Recognition, May 2004, Seoul, Korea. [ pdf]
Gestenerkennung- V.I. Pavlovic, R. Sharma, T.S. Huang: Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997. [pdf]
- Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proc. IEEE, 77 (2), 257286, 1989. [pdf]
-
Becker, D.A.: Sensei: A Real-Time Recognition, Feedback and Training System for Tai Chi Gestures. M.I.T. Media Lab Perceptual Computing Group Technical Report No. 426, 1997. [ pdf] T. Starner, J. Weaver, A. Pentland: Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(12):1371--1375, 1998. [ pdf] Cassell, J.: A Framework For Gesture Generation And Interpretation. In Cipolla, R. and Pentland, A. (eds.), Computer Vision in Human-Machine Interaction, pp. 191-215. New York: Cambridge University Press. 1998. [ pdf] Poddar, I., Sethi, Y., Ozyildiz, E. and Sharma, R.: Toward Natural Gesture/Speech HCI: A Case Studyof Weather Narration. Proc. Workshops onPerceptual User Interfaces, pages 1-6, November, 1998. [ pdf] Spracherkennung - Vorlesung Automatische Spracherkennung [html]
- Schukat-Talamazzini: Automatische Spracherkennung [ps]
E. G. Schukat-Talamazzini: Automatische Spracherkennung: Vieweg: Braunschweig/Wiesbaden 1995 [ online] Alex Waibel and Kai-Fu Lee (editors): Readings in Speech Recognition: Morgan Kaufmann Publishers, Inc.: San Mateo, CA 1990 F. Jelinek: Statistical Methods of Speech Recognition Audio-Visuelle Spracherkennung -
Uwe Meier, Rainer Stiefelhagen, Jie Yang, Alex Waibel: Towards Unrestricted Lipreading. ICMI 1999, Hong Kong. [ ps] -
U. Meier, W. Hrst, P. Duchnowski: Adaptive Bimodal Sensor Fusion For Automatic Speech Reading. ICASSP 1996, Atlanta, Georgia. [ ps] -
C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, D. Vergyri, J. Sison, A.Mashari, and J. Zhou: Audio-Visual Speech Recognition. Final Workshop 2000 Report, Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD (Oct. 12, 2000). [ pdf] Handschriftenerkennung - R. Plamondon, S. N. Srihari, On-Line and Off-line Handwriting Recognition: A Comprehensive Survey, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 1, January 2000 [pdf]
Tappert, C.C., Suen, C.Y., Wakahara, T.: The state of the art in online handwriting recognition. Pattern Analysis and Machine Intelligence, IEEE Transactions on Volume 12, Issue 8, Aug. 1990, Page(s):787 - 808 [ pdf] S. Jäger, S. Manke, A. Waibel: NPEN++: An online Handwriting Recognition System. 7th International Workshop on Frontiers in Handwriting Recognition, Amsterdam, 2000 [ pdf] Manke S., Finke M., Waibel A.: NPen++: a writer independent, large vocabulary on-line cursive handwriting. Proceedings of the Third International Conference on Document Analysis and Recognition, 1995, Volume 1, 14-16 Aug. 1995 Page(s):403 - 408 vol.1 [ pdf] - A. Waibel et al.: Phoneme Recognition Using Time-Delay Neural Networks, IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. 31. No. 3. March 1989. [pdf]
Multimodal Fusion, Integration and Systems - Kittler et al., On Combining Classifiers, IEEE Trans. On Pattern Analysis
and Machine Intelligence, Vol. 20(3), March 1998. [pdf] R. Sharma, V. Pavlovic, T. Huang, Toward Multimodal Human-Computer Interface, Proceedings of the IEEE, Vol. 86, No. 5, May 1998. [ pdf] -
A. Waibel, M. T. Vo, P. Duchnowski, S. Manke, Multimodal Interfaces, Artificial Intelligence Review, Vol. 10, pp.299-319, 1995. [ ps.gz] -
Oviatt, DeAngeli and Kuhn, Integration and synchronization of input modes during multimodal human-computer interaction, CHI, pp. 415-422, 1997. [ pdf] -
Cohen, P.R., Johnston, M., McGee, D.R., Oviatt, S.L., Pittman, J., Smith, I., Chen, L., and Clow, J., QuickSet: Multimodal interaction for distributed applications, in the Proceedings of the Fifth International Multimedia Conference (Multimedia '97), ACM Press: Seattle, WA, November, pp. 31-40. [ pdf] -
Holzapfel et al., Implementation and Evaluation of a ConstraintBased Multimodal Fusion System for Speech and 3D Pointing Gestures, Proceedings of the International Conference on Multimodal Interfaces, (ICMI), State College, 2004. [ pdf] -
Johnston, M., Unification-based multimodal parsing, Proceedings of the 36th annual meeting on Association for Computational Linguistics - Volume 1, pp. 624-630, 1998. [ pdf] -
R. Stiefelhagen, C. Fuegen, P. Gieselmann, H. Holzapfel, K. Nickel, A. Waibel, Natural Human-Robot Interaction using Speech, Gaze and Gestures, IEEE/RSJ International Conference on Intelligent Robots and Systems, Sept.2004, Sendai, Japan. [ pdf]
|