IEEE SPS SLTC E-Newsletter

Welcome to the July 2006 issue of the IEEE Signal Processing Society Speech and Language Technical Committee (SLTC) e-Newsletter. It has been an exciting spring, as we have just completed another successful ICASSP conference in Toulouse, France. This issue includes articles about the latest news and upcoming events in the spoken language research community, as well as a letter to all IEEE members regarding the upcoming elections for the Signal Processing Society's Board of Governors.

We would like to thank Isabel Trancoso and Dilek Hakkani-Tur for their contributions to this issue of the newsletter.

As always, we welcome your contributions of news, events, publications, workshops, and career information to the newsletter. Please send all articles, ideas, and feedback to the SLTC e-Newsletter Editorial Board [speechnewseds <at> ieee <dot> org]. Archives of recent SLTC e-Newsletters can be found on the SLTC website. Instructions to subscribe or unsubscribe to the eNewsletter distribution can be found here.

Have a great summer!

The SLTC e-Newsletter Editorial Board
Mike Seltzer, Brian Mak, and Gokhan Tur
[speechnewseds <at> ieee <dot> org]

Highlights from Speech and Language Technical Committee Meeting at ICASSP 2006
by Brian Mak

The Speech and Language Technical Committee (SLTC) of the IEEE Signal Processing Society met on May 18, 2006 during ICASSP 2006 in Toulouse, France. Below is a summary of some major topics that were discussed.

Please Vote in the Upcoming Elections for Member-at-Large of IEEE Signal Processing Society's Board of Governors
by Isabel Trancoso

Every Summer all members of the IEEE Signal Processing Society are asked to elect members-at-large for the Board of Governors (BoG). Unfortunately, most members don't even bother to vote, either because they don't understand the role of the BoG, or they don't know the candidates.

So the purpose of this message is to try to explain how important it will be for all the people working in spoken language processing to vote.

Many of the decisions the IEEE Signal Processing Society makes are passed for a vote in the BoG, so if you vote for somebody working in your field, you increase the chances of your field being better represented in the Society. That is visible for instance in terms of major Awards, changes in the society publications (page budgets of the Transactions, electronic pre-publication on Xplore, etc.), structure of conferences (special sessions and tutorials devoted to spoken language processing), number of fellows, structure of technical committees and councils, etc.

Spoken language processing has a very big weight in the society in terms of the number of papers presented at ICASSP (20 to 25%, typically), and number of members (possibly three to four thousand). But this weight is not at all reflected in the committees where there is one representative per area. Here the Speech and Language Technical Committee is only ONE OUT OF 12. And it is not at all reflected in the current members of the BoG.

So this message is an incentive for the speech and language community to be more active in the society and start by voting for representatives of their field in the coming BoG elections.

This time our candidate is Mazin Gilbert (formerly Rahim). Mazin has done such a terrific work as Chair of the Speech and Language Technical Committee that he does not need any introduction from me. He has been one of the strongest supporters of increasing the weight and visibility of the speech and language area within IEEE, he has done a tremendous work conducting the ICASSP review process in the last couple of years, promoting special sessions, and tutorials, setting up new workshops and many new activities within the STC.

I'm sure he will be the perfect representative of the speech and language community within the BoG. So don't throw your ballot away!

Isabel Trancoso
Member-at-Large
Board of Governors
IEEE Signal Processing Society

back to top

IEEE SPS Awards Announced for Speech and Language Researchers
by Michael L. Seltzer

AT&T, ICSI, and Edinburgh University Announce DiSCoH (Spoken Dialogue System for Conference Help)
by Dilek Hakkani-Tur

DiSCoH (Spoken Dialogue System for Conference Help) is a goal-oriented, mixed-initiative, human-machine spoken dialogue system for conference information, developed by researchers at AT&T Labs, ICSI, and Edinburgh University. The main goal of of DiSCoH is to collect naturally spoken human-machine dialogs. All of these dialogues will be logged by DiSCoH using rich representations of context and all data collected with the system will be released to the research community. It is first being deployed for the IEEE SLT Workshop 2006, see http://www.slt06.org

Soon you will be able to call +1 888 687 3887 (toll free for US callers, please check the project web page for the availability date of the system) to access DiSCoH for information about the conference including locations, dates, paper submission deadlines, program, venue, paper status, accommodation options and costs, etc.

The central motivation for this system is that the lack of large, richly annotated dialogue corpora from real spoken dialogue applications is a major barrier to progress in the research community. We believe that such a corpus, for example annotated with speech acts, user utterance transcriptions, user intentions, overall task success, etc., will be an essential resource for research in dialogue management, spoken language understanding, automatic speech recognition, and related tasks.

NIST has announced a new evaluation initiative in Spoken Term Detection (STD). This evaluation is designed to facilitate research in the area of information retrieval from speech data. Initially, the evaluation will focus on finding short word sequences rapidly and accurately across large heterogeneous collections of audio. Specifically, the STD task will be to create an index from a speech corpus and then find all occurrences, if any, of specified terms in that corpus. The search terms used for evaluation will not be known in advance. The test corpus will include data from broadcast news, telephone conversations, and meetings, and will be in English, Chinese, and Arabic. System performance will be measured based on accuracy, speed, and robustness. The schedule for the STD evaluation is as follows:

A detailed description of the evaluation plan is located on the STD website http://www.nist.gov/speech/tests/std/. Those interested in participating in the evaluation or interested in this research area can join the STD interest group mailing list by sending email to STD-info@nist.gov.

How to Subscribe to or Unsubscribe from the SLTC e-Newsletter
by Michael L. Seltzer

We have created a listserv hosted by IEEE for the distribution of the SLTC e-Newsletter, [ speechnewsdist <at> listserv <dot> ieee <dot> org ]. This list is intended for the purpose of disseminating news and information pertaining to the IEEE SPS Speech and Language Technical Committee (SLTC), and in particular for distributing the electronic newsletter of the SLTC. To receive the newsletter and any other related announcements, you can simply subscribe to the distribution list (tell your friends and colleagues!). To no longer receive the newsletter, you can unsubscribe from the list. Note that if you have received this issue (June 2006) by email, you are currently already subscribed to the list. If you wish to remain subscribed, no action needs to be taken.

To Subscribe:
Send an email with the command "subscribe speechnewsdist" in the message body to [ listserv <at> listserv <dot> ieee <dot> org ].

To Unsubscribe:
Send an email with the command "signoff speechnewsdist" in the message body to [ listserv <at> listserv <dot> ieee <dot> org ].

Note that subscribers cannot post to this distribution list. Please send all contributions, articles, ideas, and feedback to the SLTC e-Newsletter Editorial Board [ speechnewseds <at> ieee <dot> org ].

The first workshop on Spoken Language Technology (SLT) sponsored by IEEE and ACL will be held on December 10-December 13, 2006. The goal
of this workshop is to bring the speech processing and natural language processing communities together to share and present recent
advances in the area of spoken language technology, and to discuss and foster new research in this area. Spoken language technology is a
vibrant research area, with the potential for significant impact on government and industrial applications.

Submissions for the Technical Program

The workshop program will consist of tutorials, oral and poster presentations, and panel discussions. Attendance will be limited with
priority for those who will present technical papers; registration is required of at least one author for each paper. Submissions are
encouraged on any of the topics listed above. The style guide, templates, and submission form will follow the IEEE ICASSP
style. Three members of the Scientific Committee will review each paper. The workshop proceedings will be published on a CD-ROM.

Registration and Information

Registration and paper submission, as well as other workshop information, can be found on the SLT website: http://www.slt2006.org

In recent years, there has been a significant progress on software and hardware techniques for humanmachine interaction, specially, for speech-based one. While highly successful in the core and many commercial applications have resulted from the progress, the techniques have a long way to improve, specially in areas of speech signal processing, speech/audio source separation, robustness of speech systems, speech modeling, learning algorithms, real-time decoding algorithms, speech quality enhancement, speech synthesis, speaker identification, and so on, for further impacting on speech-centric human-machine interaction. This special issue intends to bring such techniques among the researchers in the field and promote new methods that may prove useful for further research in this area.

Topics: Papers showing mature results of research, advancing the state-of-the-art in terms of systems, algorithms or architectures for speech-based human machine interaction, are expected, particularly involving:

Submission procedure: Please, submit your paper to Manuscript Central at http://cs-ieee.manuscriptcentral.com/, selecting this special issue's title. Submitted articles must not have been previously published or currently submitted for journal publication elsewhere. As an author, you are responsible for understanding and adhering to our submission guidelines. You can access them by clicking on http://www.computer.org/mc/tc/author.htm. Please thoroughly read these before submitting your manuscript. Feel free to contact the Peer Review Manager, Suzanne Werner at swerner@computer.org or the guest editors at guido@ifsc.usp.br , deng@microsoft.com or maki@cslab.kecl.ntt.co.jp , if you have any questions.

INTERSPEECH 2006 - ICSLP, the Ninth International Conference on Spoken Language Processing dedicated to the interdisciplinary study of speech science and language technology, will be held in Pittsburgh, Pennsylvania, September 17-21, 2006, under the sponsorship of the International Speech Communication Association (ISCA).

The Interspeech meetings are considered to be the top international conference in speech and language technology, with more than 1000 attendees from universities, industry, and government agencies. They are unique in that they bring together faculty and students from universities with researchers and developers from government and industry to discuss the latest research advances, technological innovations, and products. The conference offers the prospect of meeting the future leaders of our field, exchanging ideas, and exploring opportunities for collaboration, employment, and sales through keynote talks, tutorials, technical sessions, exhibits, and poster sessions. In recent years the Interspeech meetings have taken place in a number of exciting venues including most recently Lisbon, Jeju Island (Korea), Geneva, Denver, Aalborg (Denmark), and Beijing.

In addition to the regular sessions, a series of special sessions has been planned for the meeting. Potential authors are invited to submit papers for special sessions as well as for regular sessions, and all papers in special sessions will undergo the same review process as papers in regular sessions. Confirmed special sessions and their organizers include:

Organizer:
Professor Richard M. Stern (General Chair)
Carnegie Mellon University
Electrical Engineering and Computer Science
5000 Forbes Avenue
Pittsburgh, PA 15213-3890
Fax: +1 412 268-3890
email: chair@interspeech2006.org

Call for Participation:
IEEE International workshop on multimedia signal processing
Victoria, BC, Canada
October 3-6, 2006

MMSP-06 is the eighth international workshop on multimedia signal processing organized by the Multimedia Signal Processing Technical Committee of the IEEE Signal Processing Society. The MMSP-06 workshop features several new components that include:

A Student Paper Contest with awards sponsored by Microsoft Research. To enter the contest a paper submission must have a student as the first author
Overview talks that present the state-of-the-art in methods and applications for selected topics of interest in multimedia signal processing
Wrap-up presentations that summarize the main contributions of the papers accepted at the workshop, hot topics and current trends in multimedia signal processing
New content requirements for the submitted papers
New review guidelines for the submitted papers

SCOPE

Papers are solicited for, but not limited to, the general areas:

Multimedia Processing (modalities: audio, speech, visual, graphics, other; processing: pre- and post- processing of multimodal data, joint audio/visual and multimodal processing, joint source/channel coding, 2-D and 3-D graphics/geometry coding and animation, multimedia streaming)
Multimedia Databases (content analysis, representation, indexing, recognition, and retrieval)
Multimedia Security (data hiding, authentication, and access control)
Multimedia Networking (priority-based QoS control and scheduling, traffic engineering, soft IP multicast support, home networking technologies, wireless technologies)
Multimedia Systems Design, Implementation and Applications (design: distributed multimedia systems, real-time and non real-time systems; implementation: multimedia hardware and software; applications: entertainment and games, IP video/web conferencing, wireless web, wireless video phone, distance learning over the Internet, telemedicine over the Internet, distributed virtual reality)
Human-Machine Interfaces and Interaction using multiple modalities
Human Perception (including integration of art and technology)
Standards

SCHEDULE

Special Sessions (contact the respective chair by): March 8, 2006 (Call for Special Sessions)
Papers (full paper, 4 pages, to be received by): April 8, 2006 (Instructions for Authors)
Notification of acceptance by: June 8, 2006
Camera-ready paper submission by: July 8, 2006 (Instructions for Authors)

Check the workshop website http://research.microsoft.com/workshops/MMSP06 for updates.

Position Available:

RESEARCHER IN MACHINE TRANSLATION
SRI INTERNATIONAL SPEECH TECHNOLOGY AND RESEARCH LABORATORY

The Speech Technology and Research (STAR) Laboratory at SRI International seeks a self-motivated, team-oriented researcher in machine translation. Highly qualified postdoctoral fellows may also apply.

The STAR Laboratory is engaged in leading-edge research in speech recognition, automatic spoken language translation, speaker recognition and verification, human-machine interfaces, and other areas of speech/language technology, and offers opportunities for basic research as well as prototyping and collaborative productization. For further details about the SRI STAR Lab please see http://www.speech.sri.com.

The successful candidate will have the opportunity to work on multiple government-funded research projects and to collaborate with other researchers at SRI and partner institutions. S/he will work on high-performance, deployable machine translation systems for multiple language pairs with varying levels of resources. A PhD with machine translation background is desired. The candidate must have strong engineering capability, with skills in C/C++ and scripting languages in a Unix/Linux environment. Strong oral and written communication skills are expected. Experience in previous NIST MT evaluations and knowledge of speech recognition are highly desirable.

Candidates must be able to work both independently and cooperatively across multiple projects with dynamically forming teams. Characteristics of STAR staff are enthusiasm, self-motivation, initiative, and passion for learning. Please apply online via https://sri.ats.hrsmart.com/cgi-bin/a/highlightjob.cgi?jobid=3027

Position Available:

Senior Researcher position in Speech Processing
at the IDIAP Research Institute, Switzerland

The IDIAP Research Institute (www.idiap.ch) is currently seeking one talented senior researcher in speech processing and/or machine learning with a proven record of high level research and project management, and whose interests are aligned with our existing strengths in speech processing (speech and speaker recognition, language understanding), or in related areas such as computer vision, biometric authentication, and multimedia data mining. A candidate with these strengths will be expected to play a leading role in the research, teaching and strategic development of the Institute.

Most of IDIAP's research activities take place in the framework of National long term research initiatives such as the National Centre of Competence in Research (NCCR) on “Interactive Multimodal Information Management” (IM2, see www.im2.ch) or large European projects such as “Augmented Multi-party Interaction” (AMI, see www.amiproject.org).

Initiated in 1991, and supported by the Swiss Federal Government, the State of Valais, and the City of Martigny, IDIAP (www.idiap.ch) is an independent, nonprofit research institute located in Martigny (at the edge of the Swiss Alps), and is affiliated with EPFL (Ecole Polytechnique Federale de Lausanne) and the University of Geneva. See http://www.idiap.ch/pages/press/faq.pdf for more information and FAQs.

IDIAP is building a culturally diverse research community and strongly encourages applications from female and minority candidates. Prospective candidates should apply with a cover letter, CV, statement of research interests and accomplishments, and names and email addresses of 3 references. Please send these to jobs@idiap.ch, with a clear reference in subject header to “senior position in speech processing”). More information can also be obtained by contacting Prof. Hervé Bourlard (bourlard@idiap.ch), Director of IDIAP.

Start dates are flexible, but applications received by April 30, 2006, will receive full consideration.

Positions Available:
Adaptive Multimodal Interface Research Lab at University of Trento (Italy)

The Adaptive Multimodal Interface Research Lab at University of Trento (Italy) has openings in the following areas:

The Adaptive Multimodal Interface research lab pursues excellence research in next-generation interfaces for human-machine and human-human communication. The research positions will be funded by the prestigious Marie Curie Excellence grant awarded by the European Commission for cutting edge and interdisciplinary research. The candidates for PhD research fellowships should have background in speech, natural language processing or machine learning. The successful applicants should have EE or CS degree with strong academic records. The students will be part of an interdisciplinary research team working on speech recognition, language understanding, spoken dialog, machine learning and adaptive user interfaces. Deadline for application submission is July 11, 2006 (see http://ict.unitn.it/). The candidates for the junior/senior researcher positions should have a PhD degree either in computer science, cognitive science or related disciplines. They will have an established international research track record in their field of expertise and leadership skills. Deadline for application submission is November 1, 2006.

The applicants should be fluent in English. The Italian language competence is optional and applicants are encouraged to acquire this skill on the job. The applicants should have good programming skills in
most of the following C++/Java/JavaScript/Perl/Python.Salaries are competitive and depending on qualifications. Relocation package might be available depending on eligibility. University of Trento is an equal opportunity employer. Interested applicants should send their CV along with their statement of research interest and three reference letters to:

Prof. Ing. Giuseppe Riccardi
Email: riccardi@dit.unitn.it
http://www.dit.unitn.it/~riccardi

About University of Trento and Information and Communication Technology Department (DIT)
The University of Trento is constantly ranked as premiere Italian graduate university institution (see www.dit.unitn.it).
DIT Department

The STC Newsletter would like to provide announcements of professors, researchers, and developers in the speech and language community taking new positions. If you have moved lately or are in the process of moving to a new position in the near future, send us your new contact information so it can be posted in the next edition.

INTERNATIONAL SCHOOL on Neural Networks " E. R. CAIANIELLO" XI COURSE on The Fundamentals of Verbal and Non-verbal Communication and the Biometrical Issue Arrival date: September 2 – Departure date: September 12, 2006 Vietri sul Mare, Italy Details are available on http://www.iiassvietri.it/school2006/index_2006.htm
. School Aims	The school will provide a broad coverage of the major developments in the area of biometrics and verbal and non verbal features exploited in face-to-face communication.
Lecturers	Guido Aversano, CNRS-LTCI, GET-ENST, France; Gérard Bailly ICP. GRENOBLE, France; Nikolaos Bourbakis, ITRI, Wright State University, USA; Ruth Bahr, University of South Florida, USA; Maja Bratanic, University of Zagreb,, Croatia; Marija Bratanic, University of Zagreb, Croatia; Paola Campadelli, Università di Milano, Italy; Nick Campell, ATR Science Labs, Kyoto, Japan Anna Esposito, Second University of Naples, IIASS, Italy; Marcos Faundez-Zanuy, Escola Universitaria Politecnica de Mataro, Spain; David House, Royal Institute of Technology, Sweden; Eric Keller, Université de Lausanne, Switzerland; Adam Kendon, University of Pennsylvania; Dominic Massaro University of Santa Cruz, USA David McNeill University, Chicago, USA; Catherine Pelachaud, Universite de Paris 8, France; Francesco Piazza, Università Politecnica delle Marche, Italy; Neda Pintaric, University of Zagreb, Croatia; Michelina Savino, University of Bari, Italy; Stefanie Shattuck-Hufnagel, MIT, Research Laboratory of Electronics, USA;
Organizing Institutions	Faculty of Psychology, Second University of Naples, Italy Faculty of Science, Mathematics, and Physics, Second University of Naples, Italy Department of Physics, University of Salerno, Italy International Institute for Advanced Scientific Studies (IIASS), Italy Centre "Ettore Majorana" for Scientific Culture, Erice, Italy
DIRECTORS OF THE COURSE Anna Esposito, Italy ; Maja Bratanic, Croatia;		PERMANENT DIRECTOR Maria Marinaro Italy	International Advisory COMMITTEE Maja Bratanic, Croatia; Anna Esposito, Italy ; Erik Keller, Switzerland; Maria Marinaro Italy
.	Application deadline June 10 2006 NATO SUPPORT for registration fee, accommodation and meals ISCA SUPPORT for young students from any country Sponsored by the International Society of Phonetic Sciences (ISPhS)

Position Available:

RESEARCHER IN MACHINE TRANSLATION SRI INTERNATIONAL SPEECH TECHNOLOGY AND RESEARCH LABORATORY

RESEARCHER IN MACHINE TRANSLATION
SRI INTERNATIONAL SPEECH TECHNOLOGY AND RESEARCH LABORATORY