|
Seminar Audio Processing and Indexing ( Will be updated.
Last updated: October 8th
2024. ) |
||||||||||||||||||||||||||||||||||
Contents
|
Period: Fall September 10th - December 17th 2023 Time: Tuesday 13.15 - 15.00 Place: online using
Kaltura, or BW 0.39 (Gorlaeus Building) Please see
announcements.
Lecturer: Dr Erwin M. Bakker ( erwin@liacs.nl ) LIACS Media Lab (LML)
Teaching assistant: to be announced
NB Register on BrightSpace.
During this seminar the fundamentals of audio processing and indexing will be studied. Applications in the area of speech recognition, audio synthesis and content based audio retrieval will be discussed. State of the art work on content based audio retrieval will be studied and presented by the participants. The seminar starts with several lectures and accompanying assignments in the form of workshops; followed by a literature selection, study, and presentations by all the students; the seminar ends with final project demos / presentations. Requirements: C, C++ Grading (6 ECTS): Presentations and Project (60% of grade). Class discussions, attendance, and workshops (40% of grade). It is necessary to be at every class and to complete every workshop. If you can not be there, you must contact Dr. E.M. Bakker before class!
Lecture slides and further materials will be made available on this site.
Discrete-Time Speech Signal Processing, Principles and Practice by T.F. Quatieri, Prentice Hall PTR; ISBN 013242942, 2002.
Fundamentals of Speech Recognition by Lawrence Rabiner, and Biing-Hwang Juang (Hardcover, 507 pages; Publisher: Pearson Education POD; ISBN: 0130151572; 1st edition, April 12, 1993)
Spoken Language Processing: A Guide to Theory, Algorithm and System Development by Xuedong Huang , Alex Acero , Hsiao-Wuen Hon , Raj Reddy (Hardcover, 980 pages; Publisher: Prentice Hall PTR; ISBN: 0130226165; 1st edition, April 25, 2001)
Speech Recognition: Theory and C++ Implementation by Claudio Bechetti and Lucio Prina Ricotti (Hardcover, 407 pages; Publisher: John Wiley & Sons; ISBN: 0471977306; 1st edition April, 1999)
You have access to all of the
listed journals by using your ULCN-
|
Schedule (tentative, visit
regularly):
Assignments (Workshops@Home):
Project Links Fall 2023
Student Paper-Presentations
Session (examples from previous years)
Previous
Project Titles and Pages
2021
During the seminar we will study state-of-the-art audio indexing methods and techniques using recent scientific publications from international journals, workshops, and conferences on content based audio retrieval. Each student will present a recent technical paper:
During the seminar each student has to do a project related to audio processing/synthesis/indexing. The agenda for the projects is as follows:
Every student has to maintain a project web page or github on which progress, documents, code, links, etc. related to the project are maintained. Here you can find an example project page. Feel free to design your own project web page though. Do not forget to mail me the link to your project page. Note: Using your university account you can put your web-page under a directory \home\public_html. |