IEEE SPEECH TECHNICAL COMMITTEE NEWSLETTER

August 24, 2005

INTRODUCTION:

Welcome to the IEEE Signal Processing Society Speech Technical Committee (STC) newsletter. Contributions of events, publications, workshops, and career information to the newsletter are welcome. Please send to Rick Rose (rose@ece.mcgill.ca). Archives of recent STC Newsletters can be found on the STC website.

STC NEWS:
New Workshop: IEEE/ACL 2006 Workshop on Spoken Language Technology
Updated EDICS for Speech Processing and Spoken Language Processing in IEEE Publications and Conferences

SPECIAL ISSUES OF TRANSACTIONS:
Special Issue of the IEEE Transactions on SAP: Progress in Rich Transcription

NEW WORKSHOP ANNOUNCEMENTS:
IEEE SPS 2006 International Workshop on Multimedia Signal Processing
The 2005 Biometric Consortium Conference
WNSP05: Workshop on Non-Linear Speech Processing
ASRU2005: IEEE SPS 2005 Workshop on Automatic Speech Recognition and Understanding
HLT/EMNLP 2005: Human Language Technology Conference / Conference on Empirical Methods in Natural Language Processing
InterSpeech 2005

CAREERS:
Speech Research Openings at ICSI
Speech Research Openings at ScanSoft
Transitions: ASR Researchers Take New Positions

LINKS TO WORKSHOPS AND CONFERENCES:
Links to conferences and workshops organized by date (Rick Rose)

IEEE/ACL 2006 Workshop on Spoken Language Technology
Aruba Marriott Resort & Stellaris Casino

Aruba, Caribbean

December 10 – December 13, 2006

Organizing Committee

General Chair:

Mazin Gilbert, AT&T, USA

Co-Chair:

Hermann Ney, RWTH Aachen, Germany

Finance Chair:

Gokhan Tur, AT&T, USA

Publication Chair:

Brian Roark, OGI, USA

Publicity Chair:

Eric Fosler-Lussier, Ohio State, USA.

Industrial Chair:

Roberto Pieraccini, Tell-Eureka, USA

The first workshop on Spoken Language Technology (SLT) sponsored by IEEE and ACL will be held on December 10-December 13, 2006. The goal of this workshop is to bring the speech processing and the natural language processing communities together to share and present recent advances in the area of spoken language technology, and to discuss and foster new research in this area. Spoken language technology is a vibrant research area, with the potential for significant impact on government and industrial applications.

Topics

Spoken language understanding; Spoken document summarization, Machine translation for speech; Spoken dialog systems; Spoken language generation; Spoken document retrieval; Human/Computer Interactions (HCI); Speech data mining; Information extraction from speech; Question/Answering from speech; Multimodal processing; Spoken language systems, applications and standards.

Paper Submission

The workshop program will consist of tutorials, oral and poster presentations, and panel discussions. Attendance will be limited with priority for those have technical papers. The style guide, templates, and submission form will follow the IEEE ICASSP style. Three members of the Scientific Committee will review each paper. At least one author of each accepted paper is required to register. The workshop proceedings will be published on a CD-ROM.

Schedule

Camera-ready paper submission deadline	July 15, 2006
Hotel Reservation and Workshop registration opens			July 30, 2006
Paper Acceptance / Rejection	September 1, 2006
Hotel Reservation and Workshop Registration closes		October 15, 2006
Workshop	December 10-13, 2006

Registration and Information

Registration will be handled via the SLT website.

Updated IEEE SPS EDICS for Speech Processing and Spoken Language Processing

The IEEE SPS Speech Technical Committee is currently in the process of updating the EDICS. EDICS stands for "EDitors Information Classification Scheme". An EDICS is in general used by journals and conferences so authors can classify their submitted papers according to the categories defined under the EDICS and editors can assign submitted papers to qualified reviewers. It is important that we as a community develop a good set of EDICS because it helps both speed reviewer assignment and ensure reviewer quality for papers submitted to both conferences (ICASSP, ASRU) and journals (IEEE Transactions on Speech and Audio Processing T-SAP).

There were two major issues that motivated the effort to update the EDICS for speech processing and spoken language processing. First, it was necessary to expand the EDICS in the spoken language area to cover recent advances and trends. Second, there was a desire to integrate the T-SAP EDICS with the ICASSP EDICS so that there is only one set of EDICS for all transactions, conferences, and workshops.

The exisitng EDICS for the IEEE Transactions on SAP can be found on their web site:

http://www.ewh.ieee.org/soc/sps/tap/edics.html

The exisitng EDICS for speech and language processing for ICASSP2006 in Toulouse can be found under Section 3.0 at

http://www.icassp2006.org/Papers/PaperTopics.asp

It is intended that the updated EDICS be used for all IEEE SPS speech processing and spoken language processing related publications, conferences, and workshops. The current version of the updated EDICS is based on input from many different people including the SAP Transactions Chief Editor, STC Chairs, and STC members. A draft version of the new and updated EDICS can be found at:

http://ssli.ee.washington.edu/~bilmes/TSAP-ICASSP-EDICS.txt

This draft version will be kept up to date as additional modifications and suggestions come in. While this EDICS has not yet been officially adopted, the SPS Board of Govenors and Publications Board will be
adopting the updated EDICS in the next few weeks. As a member of the speech community, having a good set of EDICS will improve your community. Therefore, we strongly encourage everyone to visit the new EDICS page and offer suggestions.

back to top

Call for Papers

Special Issue of
The IEEE Transactions on Speech and Audio Processing
Progress in Rich Transcription

Over the past several years, Rich Transcription has emerged as an interdisciplinary field combining automatic speech recognition, speaker identification, and natural language processing with the goal of producing richly annotated speech transcriptions that are useful both to human readers and to automated programs for indexing, retrieval and analysis. The key problems include making more accurate speech transcription technology; improving speaker recognition technology; developing fundamentally new techniques for annotating dialog with semantic intent; and enriching ASR output to present it in a maximally informative manner. These various goals interact with each other, and exploiting synergistic uses of the disparate forms of analysis is critical. With its focus on fundamental research in human communication, Rich Transcription is key to governmental applications in data mining, and to commercial applications such as call center automation and monitoring.

The purpose of this special issue is to present recent advances in all areas of Rich Transcription for Speech, Audio, and Spoken Language Dialog. Original, previously unpublished submissions for the following areas are encouraged:

Speech Recognition algorithms and methods
Natural Language Processing for Rich Transcription
Speaker Recognition algorithms and methods
Algorithms for exploiting large amounts of training data
Novel Approaches to feature extraction and ASR
Unsupervised and semi-supervised training
Paradigms for Data Collection
Usability and human factors studies
Performance analysis and evaluation
Applications in marketing, business & security
Tools and solutions for rich transcription
Incorporation of prosodic and supralexical information

Submission procedure:

Prospective authors should prepare manuscripts according to the Information for Authors as published in any recent issue of the Transactions and as available on the web at http://www.ieee.org/organizations/society/sp/infotsa.html. Note that all rules will apply with regard to submission lengths, mandatory overlength page charges, and color charges.

Manuscripts should be submitted electronically through the online IEEE manuscript submission system at http://sps-ieee.manuscriptcentral.com/. When selecting a manuscript type, authors must click on "Special Issue of T-SA on Progress in Rich Transcription." Authors should follow the instructions for the IEEE Transactions on Speech and Audio Processing and indicate in the Comments to the Editor-in-Chief that the manuscript is submitted for publication in the Special Issue on Progress in Rich Transcription. We require a completed copyright form to be signed and faxed to 1-732-562-8905 at the time of submission. Please indicate the manuscript number on the top of the page.

Schedule:

Submission deadline: 1 October 2005

Notification of acceptance: 1 April 2006

Final manuscript due: 31 May 2006

Tentative publication date: September 2006

Guest Editors:

Dr. Geoffrey Zweig	IBM, Yorktown Heights, NY.	gzweig@us.ibm.com
Dr. John Makhoul	BBN Technologies, Cambridge MA.	makhoul@bbn.com
Dr. Barbara Peskin	ICSI, Berkeley, CA.	barbara@icsi.berkeley.edu
Dr. Phil Woodland	Cambridge University, Cambridge, U.K.	pcw@eng.cam.ac.uk
Dr. Andreas Stolcke	SRI International, Menlo Park, CA.	stolcke@speech.sri.com

General Chair

Li DENG, Microsoft Research,
USA, deng@microsoft.com

Technical Chairs

Zicheng LIU, Microsoft Research, USA, zliu@microsoft.com

Adriana DUMITRAS, Apple Computer, USA, adrianad@ieee.org

Zhengyou ZHANG, Microsoft Research, USA, zhang@microsoft.com

Special Sessions Chair

Shahram SHIRANI, McMaster Univ., Canada, shirani@ece.mcmaster.ca

Finance Chair

Belle TSENG, NEC Labs America, USA, belle@sv.nec-labs.com

Publicity Chair

Jane WANG, Univ. British Columbia, Canada, zjanew@ece.ubc.ca

Registration Chair

George TZANETAKIS, Univ. Victoria, Canada, gtzan@cs.uvic.ca

Electronic Media Chair

Zhen WEN, IBM Research, USA, zhenwen@us.ibm.com

Publications Chairs

George TZANETAKIS, Univ. Victoria, Canada, gtzan@cs.uvic.ca

Haitao ZHENG, Univ. California Santa Barbara, USA, htzheng@ieee.org

Local Arrangements Chairs

Ali SHOJA, Univ. Victoria,
Canada, gshoja@cs.uvic.ca

Dong YU, Microsoft Research, USA, dongyu@microsoft.com

Asia Liaison

Sadaoki FURUI, Tokyo Inst. of Tech., Japan, furui@cs.titech.ac.jp

Europe Liaison

Pascal FROSSARD, EPFL, Switzerland, pascal.frossard@epfl.ch

IEEE Signal Processing Society

2006 International Workshop

on Multimedia Signal Processing

October 3-6, 2006

Fairmount Empress Hotel

Victoria, BC, Canada

MMSP-06 Call for Papers

MMSP-06 is the eighth international workshop on multimedia signal processing organized by the Multimedia Signal Processing Technical Committee of the IEEE Signal Processing Society. The MMSP-06 workshop features several new components that include:

· A Student Paper Contest with awards sponsored by Microsoft Research. To enter the contest a paper submission must have a student as the first author

· Overview sessions that consist of papers presenting the state-of-the-art in methods and applications for selected topics of interest in multimedia signal processing

· Wrap-up presentations that summarize the main contributions of the papers accepted at the workshop, hot topics and current trends in multimedia signal processing

· New content requirements for the submitted papers

· New review guidelines for the submitted papers

SCOPE

Papers are solicited for, but not limited to, the general areas:

· Multimedia Processing (modalities: audio, speech, visual, graphics, other; processing: pre- and post- processing of multimodal data, joint audio/visual and multimodal processing, joint source/channel coding, 2-D and 3-D graphics/geometry coding and animation, multimedia streaming)

· Multimedia Databases (content analysis, representation, indexing, recognition, and retrieval)

· Multimedia Security (data hiding, authentication, and access control)

· Multimedia Networking (priority-based QoS control and scheduling, traffic engineering, soft IP multicast support, home networking technologies, wireless technologies)

· Multimedia Systems Design, Implementation and Applications (design: distributed multimedia systems, real-time and non real-time systems; implementation: multimedia hardware and software; applications: entertainment and games, IP video/web conferencing, wireless web, wireless video phone, distance learning over the Internet, telemedicine over the Internet, distributed virtual reality)

· Human-Machine Interfaces and Interaction using multiple modalities

· Human Perception (including integration of art and technology)

· Standards

SCHEDULE

· Special Sessions (contact the respective chair by): March 8, 2006 (Call for Special Sessions)

· Papers (full paper, 4-6 pages, to be received by): April 8, 2006 (Instructions for Authors)

· Notification of acceptance by: June 8, 2006

· Camera-ready paper submission by: July 8, 2006 (Instructions for Authors)

Check the workshop website http://research.microsoft.com/workshops/MMSP06 for updates.

THE BIOMETRIC CONSORTIUM 2005 CONFERENCE

September 19-21, 2005
Hyatt Regency Crystal City
Arlington, VA, U.S.A.
(703) 418 1234
Conference Website: http://www.nist.gov/bc2005

Registration fee: $540.00:300

The program includes two and a half days of presentations, seminars and panel discussions with the participation of internationally recognized experts in biometric technologies, system and application developers and users. Over 100 speakers from government, industry, and academia will participate. The multi-track conference offers technology seminars and over 70 biometric technology exhibits (open during the entire conference). The program includes the Tutorial “Introduction to Biometrics” and the Research Symposium (a special session on research offered by the Biometric Knowledge Center of the Center for Identification Technology Research (CITeR).

The Biometric Consortium conference will address the latest trends in biometrics research, development, testing and evaluation as well applications of biometric technology and the current status of biometric standards and the adoption of these standards. It will also address the important role that biometrics can play in the identification and verification of individuals in this age of heightened security and privacy by examining biometric-based solutions for homeland security as well as the utilization of biometrics in other government and commercial applications.

Scheduled sessions include major government efforts such the Executive Office the President of the United States, National Science & Technology Council, Subcommittee on Biometrics, and government programs within Department of Defense, Department of Homeland Security, Department of Justice, National Institute of Standards and Technology; as well Biometric Solutions & Applications, Large-ScaIe Biometric Implementations, Advances in Biometric Technologies, Security of Biometrics, Biometric Standards, Special Sessions on Biometrics and Physical Access Control and Privacy Policy and Cross Jurisdictional Issues of the Use for Biometrics and Biometric Testing. Attendees will also learn from the Biometric Industry Association (IBIA) and the National Biometric Security Project.

Sponsors:
The following organizations are proud to co-sponsor this event:
Information Technology Laboratory (ITL) of the National Institute of Standards and Technology (NIST)
National Security Agency (NSA)
DoD Biometrics Management Office (BMO)
National Biometric Security Project (NBSP)
Department of Homeland Security (DHS)
General Services Administration's Office of Governmentwide Policy
National Institute of Justice (NIJ)
West Virginia Development Office (WVDO).

Supporting Organizations:
Executive Office of the President of the United States, National Science & Technology Council, Subcommittee on Biometrics
American National Standards Institute (ANSI)
BioAPI Consortium
Center for Identification Technology Research (CITeR).
International Biometrics Industry Association (IBIA)
InterNational Committee for Information Technology Standards (INCITS)
The Biometric Foundation

Location:
The Conference will be held at the Hyatt Regency Crystal City, Arlington, VA, USA. More information about the hotel can be found at:
http://www.hyatt.com/usa/arlington/hotels/hotel_wasrc.html

Conference Registration:
The registration fee ($540.00 per person) includes coffee breaks, lunches
(Monday and Tuesday), reception on Monday evening, entrance to exhibit hall and
conference materials.

Accommodations:
The Conference registration does not include your hotel reservation. A block of rooms has been reserved at the Hyatt Regency Crystal City, (703) 418-1234. To register for a room, please contact the Hyatt and mention that you are attending "The Biometric Consortium Conference" no later than Sunday August 28, 2005.

For more information or to register for the Conference please check our web site http://www.nist.gov/bc2005

Collocated activities:

The following meetings will take place collocated with the conference. Details on these meetings (open to the public) will be posted in the conference web site:

InterNational Committee for Information Technology Standards Technical Committee M1 (Biometrics) Ad Hoc Group on Biometrics and E-Authentication (AHGBEA)
back to top

WNSP05: WORKSHOP ON NON-LINEAR
SPEECH PROCESSING

Heraklion , Crete, Greece
20-23 Sept 2005

http://www.ics.forth.gr/wnsp05/index.html

WNSP05 is organized by the Management Committee (MC) http://www.nolisp2005.org/cost/index.php?content=participation.htm#management of COST-277 European Research Action <http://www.nolisp2005.org/cost/> as a parallel event to the last meeting of the Action. Members of the MC of COST 277 are invited to present overviews as well as novel ideas on nonlinear speech processing algorithms, methods, data collection and applications.

This is a unique opportunity for researchers outside the Action to submit their work, sharing their view with the members of the Action. This meeting will give engineers and PhD students a complete picture of what the challenges and solutions are and in what directions research in nonlinear speech processing is moving.

The goal of WNSP05 is to provide an overview of research on nonlinear speech processing and indicating future directions. We plan to provide Multipoint Video streaming so people from the connected sites would be able to interact with the participants of the meeting. Also Best Effort Video streaming will be available while selected talks will be video recorded for online access later on. Selected contributions will be published in a book edited by members of the MC of the Action.

--
Yannis Stylianou, Associate Professor
Department of Computer Science
University of Crete
Leof. Knossou
714 09 Heraklion, Crete, Greece
Ph: +30 2810 393 559
Fax: +30 2810 393 501
back to top

2005 IEEE Automatic Speech Recognition and Understanding Workshop
November 27 - December 1, 2005
Fiesta Americana Grand Coral Beach Resort
Cancun, Mexico
www.asru2005.org

REMINDER: CALL FOR DEMONSTRATIONS

The ninth bi-annual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU) will be held November 27-December 1, 2005. The venue will be the Fiesta Americana Grand Coral Beach Resort in Cancun, Mexico. The ASRU workshops have a tradition of bringing together researchers from academia and industry in an intimate and collegial setting to discuss problems of common interest in automatic speech recognition and understanding. Prospective demonstrators are invited to submit 1-2 page proposals via the workshop website for demonstrating leading edge technology within the broad scope of the workshop. Proposals in all areas of human language technology are encouraged to be submitted with emphasis placed on:

automatic speech recognition and understanding technology
speech to text systems
spoken dialog systems
multilingual language processing
robustness in ASR
spoken document retrieval
speech-to-speech translation

All demo proposals will be handled and reviewed electronically. Please identify any special audio-visual needs in your proposal.

IMPORTANT DATES

Aug. 29, 2005	Demo submission deadline
September 2, 2005	Demo decision notices emailed
September 15, 2005	Author registration deadline
October 1, 2005	Workshop early registration deadline
October 27, 2005	Hotel reservation deadline
Nov. 27 - Dec. 1, 2005	Workshop

HLT/EMNLP 2005:
Registration and Student Volunteer Recruitment

Human Language Technology Conference/
Conference on Empirical Methods in Natural Language Processing

            Vancouver, B.C., Canada

            Conference date: October 6-8, 2005
            Early registration deadline: Sept 1, 2005
            Student volunteer application deadline: Aug 26, 2005
                     URL: http://www.hlt-emnlp05.org

In 2005, HLT (Human Language Technology Conference) and EMNLP (Conference on Empirical Methods in Natural Language Processing) will be a joint
conference held in Vancouver, B.C.

HLT/EMNLP 2005 continues the conference series jointly sponsored by the Human Language Technology Advisory Board (HLT) and the Association for Computational Linguistics (ACL). This year's conference is co-sponsored by SIGDAT, the ACL's special interest group on linguistic data and corpus-based approaches to NLP, which has traditionally sponsored the Empirical Methods in Natural Language Processing (EMNLP) Conferences.

The program will include presentation of 98 research talks, 30 research posters, and 20 system demonstrations covering a broad selection of areas within human language technology, see the website for a detailed list of papers and demonstrations. There will also be invited talks by Larry Hunter, University of Colorado, on NLP for bioinformatics and Sanjeev Khudanpur, Johns Hopkins University, on NLP and speech systems. Registration for the conference is now available on-line at the website: http://www.hlt-emnlp05.org

For payment, there is a secure connection to Plug N Pay or you may also download the form, complete it and fax or mail it to Priscilla Rasmussen with your payment:

Priscilla Rasmussen
HLT/EMNLP
3 Landmark Center
East Stroudsburg, PA 18301
Fax: +1-570-476-8006
Tel: +1-570-476-8006

HLT-EMNLP 2005: Call for Student Volunteers

HLT-EMNLP 2005 is looking for student volunteers. In exchange for one full day's work, student volunteers receive free registration to the main conference. The work will be divided, probably into two half-day shifts, and the shifts will be scheduled to maximize volunteer access to the conference events. We may be able to provide other amenities, and aim to provide a good work environment.

Tasks will include assisting at the registration desk, stuffing delegate packs, and providing technical assistance for conference events. Wishes about attendance to specific conference events will be taken into account when possible in allocating tasks. Volunteers are needed from the morning of October 5th through October 10th, 2005. (But we do not require all volunteers to arrive by October 5th or stay until October 10th. However, if you can come to Vancouver for one day only we may not be able to assign you to a volunteer task.)

The Student Volunteer Program is open to full-time students only. There is no application form. Apply by sending email to the following address:
"Anoop Sarkar" <anoop@cs.sfu.ca>

**Important**: put "HLT-EMNLP-05 Student Volunteer Application" in the subject header of your message.

In your application, include the dates you can be in Vancouver, whether you are presenting a paper or poster, what academic program you are enrolled in (which institution and whether UGrad/MSc/Phd), and what year of your program you are in. Include any other information that you think may be important.

The application deadline is Monday, August 26, 2005. We aim to complete the entire selection process before August 30. Notification will be done by email as we receive application emails. Please do *not* register online as a student volunteer until you have received a notification email. The early registration deadline is September 1, 2005 -- you will have time to get early registration rate even if you do not receive a volunteership.

Student Volunteer Coordinators for HLT-EMNLP 2005:

Anoop Sarkar <anoop@cs.sfu.ca>
Fred Popowich <popowich@cs.sfu.ca>
back to top

Interspeech 2005 / Eurospeech
September 4--8, 2005
Lisbon, Portugal
www.interspeech2005.org

CALL FOR PARTICIPATION

ISCA, together with the Interspeech 2005 - Eurospeech organizing committee, would like to encourage participation in the upcoming conference.

Interspeech 2005 - Eurospeech is the 9th biennal conference of the International Speech Communication Association (ISCA) and the sixth in the annual series of Interspeech events, after Beijing, Aalborg, Denver, Geneva and Jeju.

This year's theme is "Ubiquitous Speech Processing", a long-term goal for this new millenium, which is also the topic of the panel of our 4th and last plenary session. The conference follows the typical format of Interspeech events, with parallel oral and poster sessions, and a first Tutorial Day program featuring six in-depth tutorials from world-class experts in their field.

The conference will be held September 4-8 in Lisbon, Portugal, at Centro Cultural de Belem, located in the city's most renowned historic area, next to the Jeronimos Monastery and facing the river Tagus. Perched on the edge of the Atlantic Ocean, Lisbon is a city like no other. The city center is a buzzing grid of grand streets with
all the trappings of any other Western European city, but also rumbling old trams, antiquated funiculars and elevators that help negotiate Lisbon's steep hills.

The Technical Program will include parallel oral and poster sessions, totaling 781 papers, selected from 1261 regular full paper submissions. Additionally, it will include 10 Special Sessions on new, emerging areas of interest to the spoken language processing community.

KEYNOTE SPEAKERS:
- Monday 5th: ISCA Medalist
- Tuesday 6th: Fernando Pereira, Univ. Pennsylvania, USA
Title: Linear Models for Structure Prediction
- Wednesday 7th: Elizabeth Shriberg, SRI and ICSI, USA
Title: Spontaneous Speech:
How People Really Talk, and Why Engineers Should
Care

TUTORIALS:
Tutorials for Interspeech'2005 will be held on Sunday, September 4th, organized by internationally recognized experts in their fields. The idea behind the tutorials is t provide an overview of the topic as well as to bring to light the recent developments in fields concerning spoken language processing.

- Languages for Developing Multimodal Applications
James A. Larson
- VoIP Technologies and Speech Applications
Gerald Karam and Giuseppe Di Fabbrizio
- Usability Theory and Practice in Speech Interaction System Design
Fang Chen and Kristiina Jokinen
- Visions, Technology and Business of Conversational Machines
Mazin Gilbert and Roberto Pieraccini
- Forensic Speech Science
Anders Eriksson and Andrzej Drjgajlo
- Building Synthetic Voice
Alan Black

OTHER EVENTS:
- Monday 5th: Opening Ceremony
Welcome Reception (Cloisters)

- Tuesday 6th: ISCA SIG Meeting
Student Reception
Scientific Review Committee Reception (Castle)

- Wednesday 7th: Panel - Towards a SIG on Iberian Languages
Chair: Nestor Yoma
Panel - History of Speech Technology
Chairs: Janet Baker & Patri J. Pugliese
Panelists: Gunnar Fant, Jim Flanagan,
Hiroya Fujisaki, Joseph Mariani, Louis Pols
ISCA General Assembly
Banquet

- Thursday 8th: Panel - Ubiquitous Speech Processing
Chair: Roger K. Moore
Panelists: Alex Acero, Jordan Cohen,
Paul Dalsgaard and Sadaoki Furui
Closing Session / History of Fado

ACCOMMODATION:
A large number of rooms has been reserved in several hotels. Hotel booking can be done through the travel agency Viagens Abreu, by completing the Hotel Reservation Form (see website) and returning with payment, no later than June 30th.

For further information: www.interspeech2005.org or send email to info@interspeech2005.org

Organizer:
L2F - Spoken Language Systems Laboratory, INESC ID Lisboa
Rua Alves Redol, 9 - 1000-029 Lisbon - Portugal
Phone:+351 213100268 Fax: +351 213145843 www.l2f.inesc-id.pt
back to top

SPEECH RESEARCH OPENINGS AT ICSI

The International Computer Science Institute (ICSI) invites applications for positions in speech processing. Interested parties with a range of experience (e.g., both recent PhDs and those with more extensive experience) are encouraged to apply.

ICSI is an independent not-for-profit Institute located a few blocks from the Berkeley campus of the University of California. It is closely affiliated with the University, and particularly with the Electrical Engineering and Computer Science (EECS) Department. See http://www.icsi.berkeley.edu to learn more about ICSI.

The ICSI Speech Group (including its predecessor, the ICSI Realization Group) has been a source of novel approaches to speech processing since 1988. It is primarily known for its work in speech recognition, although it has housed major projects in speaker recognition, metadata extraction, and speech coding in the last few years.

Applications should include a cover letter, vita, and the names of at least 3 references (with both postal and email addresses). Applications should be sent by email to morgan@icsi.berkeley.edu and by postal mail to

Director (re Speech Search)
ICSI
1947 Center Street
Berkeley, CA 94704

ICSI is an Affirmative Action/Equal Opportunity Employer. Applications from women and minorities are especially encouraged. Hiring is contingent on eligibility to work in the United States.
back to top

SPEECH RESEARCH OPENINGS AT SCANSOFT

Job Description: Senior language modeling scientist

Responsibilities: The successful candidate will play a key role in developing and improving techniques for language modeling and text processing as part of our transcription software. The result will be more accurate, robust, and functional software for a variety of transcription software projects. You will join a small and talented group of language modeling scientists in our Burlington, MA offices.

Qualifications: You should have a good degree in speech recognition, computational linguistics, or a related field, and preferably a Ph.D. A background in speech, mathematics, computer science, and computational linguistics is highly desirable. You should have strong communications skills and be able to work collaboratively with others on the team and elsewhere within the company. Strong software skills are required, especially Unix/Linux, C/C++, and a scripting language such as Perl or Python. Explicit experience with language modeling for transcription is a plus.

We offer a stimulating research environment with substantial data and computing resources. A competitive salary and benefit package will be offered. Please send your resume to Jeff Adams at jeff.adams@scansoft.com .

Job Description: Senior research scientist in speech recognition

Responsibilities: The candidate will work on improving recognition performance of speech recognition engine and its main application in Scansoft’s award-winning dictation products. Dragon NaturallySpeaking® is our market-leading desktop dictation product. The recently released version 8 showed substantial accuracy improvements over previous versions. DragonMT is our new medical transcription server, which brings the benefit of ScanSoft’s dictation technology to the problem of machine assisted medical transcription. We are looking for an individual who wants to solve difficult speech recognition problems, and help get those solutions into our products, so that our customers can work more effectively.

Main responsibilities of the candidate will include:

provide experimental and theoretical analysis of speech recognition problems
formulate new algorithms, create research tools, design and carry out experiments to verify new algorithms
work with other members in the team to improve the performance of our products and add new product features to meet business requirements
work with other team members to deliver acoustic models for products
work with development engineers to insure a high quality implementation of algorithms and models in company products
follow developments in speech recognition to keep our research work state-of-the-art
patent new algorithms and write scientific papers when appropriate

Qualifications:

Ph.D. or Master degree in computer science or electrical engineering
good analytical and diagnostic skills
experience with C/C++, scripting using Perl, Python and csh in UNIX environment
ability to work with a large existing code base
desire and ability to be a team player
strong desire and demonstrated ability to work on and solve engineering problems

Preference will give to candidates who have strong speech recognition background. Previous involvement in DARPA EARS project is a plus. New graduates with good GPA from top universities are encouraged to apply.

Job Description: Senior Data Engineer

Responsibilities: The successful candidate will play a key role in supporting the vast amounts of data used in designing, training, and testing our speech transcription software. The result will be more accurate, robust, and functional software for a variety of transcription software projects. You will join a small and talented group in our Burlington, MA offices. You will help us collect, organize, and prepare speech & text data in support of our research into speech recognition technology. You should have strong computer skills, to include Unix, Perl (or a similar scripting language), C/C++, and a familiarity with text & audio encoding formats. A background in speech & language processing or research is highly desirable, as is experience working with very large corpora (speech and or text).
Qualifications: You will be self-motivated, able to plan the details of accomplishing a given task, and able to anticipate and prepare for future needs. This is a senior position, and the successful candidate would be expected to familiarize themselves with our needs and processes, and to take initiative to find and fill gaps in our data, or fix weaknesses in our processes. This employee should be "lazy" enough to find automated, permanent solutions to problems, rather than temporary fixes.
We offer a stimulating research environment with substantial data and computing resources. A competitive salary and benefit package will be offered.

Please send your resume to Jeff Adams at jeff.adams@scansoft.com .
back to top

ASR Researchers Take New Positions

The STC Newsletter would like to provide announcements of professors, researchers, and developers in the speech area taking new positions. If you have moved lately or are in the process of moving to a new position in the new future, send your new contact information to the STC Newsletter so it can be posted in the next edition.

In August 2005, Michiel Bacchiani moved from the IBM T.J. Watson Research Center in Yorktown Heights, NY to Google in New York, NY.

Google Inc.
1440 Broadway,
New York, NY 10018
michiel at google dot com

Starting on July 27 2005, Roberto Pieraccini took the position of CTO at Tell-Eureka (www.telleureka.com), a Manhattan based company specializing in spoken dialog systems for selected vertical markets. He has left his previous position at IBM T.J. Watson Research Center in Yorktown Heights, NY

robertopieraccini at yahoo dot com

Links to Upcoming Conferences and Workshops

(Organized by Date)

Auditory-Visual Speech Processing (AVSP 2005)
Vancouver Island, British Columbia, Canada, July24-27
http://marcs.uws.edu.au/links/avisa/avsp05

SIGdial Workshop on Discourse and Dialog
Lisbon, Portugal , September 2-3, 2005
http://www.sigdial.org/workshops/workshop6

ITRW 2005 Workshop on DSP for IN-VEHICLE and MOBILE SYSTEMS
Sesimbra, Portugal, September 3, 2005
http://dspincars.sdsu.edu/

EUROSPEECH 2005 9th European Conference on Speech Communication and Technology
Lisbon, Portugal, September 4-8, 2005
http://www.interspeech2005.org/

Disfluency in Spontaneous Speech
Aix-en-Provence, September 10-12, 2005
http://www.up.univ-mrs.fr/delic/Diss05

WNSP-2005 Workshop on Non-Linear Speech Processing
Heraklion, Crete, Greece, September 20-23, 2005
http://www.ics.forth.gr/wnsp05/index.html

HLT/EMNLP 2005
Vancouver, B.C., Canada, October 6-8, 2005
http://www.hlt-emnlp05.org

IEEE WASPAA2005 Workshop on Applications of Signal Processing to Audio and Acoustics
New Paltz, New York, October 16-19, 2005
http://www.LNT.de/~WASPAA05/

SPECOM 2005 - 10th International Conf. on Speech and Computers
Patras, Greece, October 17-19, 2005
http://www.wcl.ee.upatras.gr/specom2005.htm

IEEE ASRU2005 Automatic Speech Recognition and Understanding Workshop
Cancun, Mexico, November 27 - December 1, 2005
http://www.asru2005.org

ICASSP2006
Toulouse, France May 15-19, 2006
http://www.icassp2006.org

INTERSPEECH 2006 - ICSLP
Pittsburgh, PA, USA September 17-21, 2006
http://www.interspeech2006.org/

MMSP-2006
Victoria, BC Canada October 3-6, 2006
http://research.microsoft.com/workshops/MMSP06

IEEE/ACL Workshop on SLT
Aruba, Deccember 10-13, 2006

ICASSP2007
Honolulu, Hawaii, USA, 2007, April 17-20

INTERSPEECH 2007
Antwerp, Belgium, August 27-31, 2007
http://www.interspeech2007.org/

Submission deadline:	1 October 2005
Notification of acceptance:	1 April 2006
Final manuscript due:	31 May 2006
Tentative publication date:	September 2006