IEEE
SPEECH TECHNICAL COMMITTEE NEWSLETTER
August 24, 2005
INTRODUCTION:
Welcome to the IEEE Signal Processing Society Speech Technical Committee (STC)
newsletter. Contributions
of events, publications, workshops, and career information to the
newsletter are welcome. Please send to Rick Rose
(rose@ece.mcgill.ca). Archives of recent STC Newsletters
can be found on the STC
website.
STC NEWS:
New
Workshop: IEEE/ACL
2006 Workshop on Spoken Language Technology
Updated
EDICS for Speech Processing and Spoken Language Processing in IEEE
Publications and Conferences
SPECIAL ISSUES OF TRANSACTIONS:
Special
Issue of the IEEE Transactions on SAP: Progress in Rich Transcription
NEW
WORKSHOP ANNOUNCEMENTS:
IEEE SPS
2006 International
Workshop on Multimedia Signal Processing
The
2005 Biometric Consortium Conference
WNSP05:
Workshop on Non-Linear Speech Processing
ASRU2005: IEEE SPS 2005 Workshop
on Automatic Speech Recognition and Understanding
HLT/EMNLP
2005: Human Language Technology Conference / Conference on Empirical
Methods in Natural Language Processing
InterSpeech
2005
CAREERS:
Speech Research Openings at ICSI
Speech
Research Openings at ScanSoft
Transitions:
ASR Researchers Take New Positions
LINKS TO WORKSHOPS AND CONFERENCES:
Links to conferences and
workshops organized by date (Rick Rose)
back
to top
IEEE/ACL
2006 Workshop on Spoken Language Technology
Aruba Marriott Resort
& Stellaris Casino
Aruba,
Caribbean
December 10 – December 13, 2006
Organizing
Committee
General Chair:
Mazin
Gilbert, AT&T, USA
Co-Chair:
Hermann
Ney, RWTH Aachen, Germany
Finance Chair:
Gokhan
Tur, AT&T,
USA
Publication Chair:
Brian
Roark, OGI, USA
Publicity Chair:
Eric
Fosler-Lussier, Ohio State, USA.
Industrial Chair:
Roberto
Pieraccini, Tell-Eureka, USA
<>
|
The first
workshop on Spoken Language Technology (SLT) sponsored by IEEE and ACL
will be
held on December 10-December
13, 2006. The goal of this workshop is to bring
the
speech processing and the natural language processing communities
together to
share and present recent advances in the area of spoken language
technology,
and to discuss and foster new research in this area. Spoken language
technology
is a vibrant research area, with the potential for significant impact
on
government and industrial applications.
Topics
Spoken language understanding; Spoken document
summarization, Machine
translation for speech; Spoken dialog systems; Spoken language
generation;
Spoken document retrieval; Human/Computer Interactions (HCI); Speech
data
mining; Information extraction from speech; Question/Answering from
speech; Multimodal
processing; Spoken language systems, applications and standards.
Paper
Submission
The workshop
program will consist of tutorials, oral and poster presentations, and
panel
discussions. Attendance will be limited with priority for those have
technical
papers. The style guide, templates, and submission form will follow the
IEEE ICASSP
style. Three members of the Scientific Committee will review each
paper. At
least one author of each accepted paper is required to register. The
workshop
proceedings will be published on a CD-ROM.
Schedule
Camera-ready paper submission deadline
|
July 15, 2006
|
Hotel Reservation and Workshop registration opens
|
July 30, 2006
|
Paper Acceptance / Rejection
|
September 1, 2006
|
Hotel Reservation and Workshop Registration closes
|
October
15, 2006
|
Workshop
|
December 10-13, 2006
|
|
|
|
|
Registration
and Information
Registration will be
handled
via the SLT website.
|
back
to top
Updated IEEE SPS EDICS for Speech
Processing and Spoken Language Processing
The IEEE SPS Speech Technical Committee is currently in the process of updating the EDICS. EDICS stands for "EDitors Information Classification Scheme". An EDICS is in general used by journals and conferences so authors can classify their submitted papers according to the categories defined under the EDICS and editors can assign submitted papers to qualified reviewers. It is important that we as a community develop a good set of EDICS because it helps both speed reviewer assignment and ensure reviewer quality for papers submitted to both conferences (ICASSP, ASRU) and journals (IEEE Transactions on Speech and Audio Processing T-SAP).
There were two major issues that motivated the effort to update the
EDICS for speech processing and spoken language processing. First, it
was necessary to expand the EDICS in the spoken language area to cover
recent advances and trends. Second, there was a desire to integrate the
T-SAP EDICS with the ICASSP EDICS so that there is only one set of
EDICS for all transactions, conferences, and workshops.
The exisitng EDICS for the IEEE Transactions on SAP can be found
on their web site:
http://www.ewh.ieee.org/soc/sps/tap/edics.html
The exisitng EDICS for speech and language processing for
ICASSP2006 in Toulouse can be found under Section 3.0 at
http://www.icassp2006.org/Papers/PaperTopics.asp
It is intended that the updated EDICS be used for all IEEE SPS speech
processing and spoken language processing related publications,
conferences, and workshops. The current version of the updated EDICS is
based on input from many different people including the SAP
Transactions Chief Editor, STC Chairs, and STC members. A draft version
of the new and updated EDICS can be found at:
http://ssli.ee.washington.edu/~bilmes/TSAP-ICASSP-EDICS.txt
This draft version will be kept up to date as additional modifications
and suggestions come in. While this EDICS has not yet been officially
adopted, the SPS Board of Govenors and Publications Board will be
adopting the updated EDICS in the next few weeks. As a member of the
speech community, having a good set of EDICS will improve your
community. Therefore, we strongly encourage everyone to visit the new
EDICS page and offer suggestions.
back
to top
Call
for Papers
Special Issue of
The IEEE Transactions on Speech and Audio Processing
Progress in Rich Transcription
Over the past several years, Rich Transcription has emerged as an
interdisciplinary field combining automatic speech recognition,
speaker identification, and natural language processing with the goal
of producing richly annotated speech transcriptions that are useful
both to human readers and to automated programs for indexing,
retrieval and analysis. The key problems include making more accurate
speech transcription technology; improving speaker recognition
technology; developing fundamentally new techniques for annotating
dialog with semantic intent; and enriching ASR output to present it in
a maximally informative manner. These various goals interact with each
other, and exploiting synergistic uses of the disparate forms of
analysis is critical. With its focus on fundamental research in human
communication, Rich Transcription is key to governmental applications
in data mining, and to commercial applications such as call center
automation and monitoring.
The purpose of this special issue is to present
recent advances in all
areas of Rich Transcription for Speech, Audio, and Spoken Language
Dialog. Original, previously unpublished submissions for the following
areas are encouraged:
- Speech Recognition algorithms and methods
- Natural Language Processing for Rich Transcription
- Speaker Recognition algorithms and methods
- Algorithms for exploiting large amounts of training data
- Novel Approaches to feature extraction and ASR
- Unsupervised and semi-supervised training
- Paradigms for Data Collection
- Usability and human factors studies
- Performance analysis and evaluation
- Applications in marketing, business & security
- Tools and solutions for rich transcription
- Incorporation of prosodic and supralexical information
Submission procedure:
Prospective authors should prepare manuscripts
according to the
Information for Authors as published in any recent issue of the
Transactions and as available on the web at
http://www.ieee.org/organizations/society/sp/infotsa.html. Note
that all rules will apply with regard to submission lengths, mandatory
overlength page charges, and color charges.
Manuscripts should be submitted electronically through the online IEEE
manuscript submission system at http://sps-ieee.manuscriptcentral.com/.
When selecting a manuscript type, authors must click on
"Special Issue of T-SA on Progress in Rich
Transcription." Authors should follow the instructions for the
IEEE Transactions on Speech and Audio Processing and indicate in the
Comments to the Editor-in-Chief that the manuscript is submitted for
publication in the Special Issue on Progress
in Rich Transcription. We require a completed copyright form
to be signed and faxed to 1-732-562-8905 at the time of
submission. Please indicate the manuscript number on the top of the
page.
Schedule:
Submission deadline: |
1 October 2005 |
Notification of acceptance: |
1 April 2006 |
Final manuscript due: |
31 May 2006 |
Tentative publication date: |
September 2006 |
Guest Editors:
Dr. Geoffrey Zweig |
IBM, Yorktown Heights, NY. |
gzweig@us.ibm.com |
Dr. John Makhoul |
BBN Technologies, Cambridge MA. |
makhoul@bbn.com |
Dr. Barbara Peskin |
ICSI, Berkeley, CA. |
barbara@icsi.berkeley.edu |
Dr. Phil Woodland |
Cambridge University, Cambridge, U.K. |
pcw@eng.cam.ac.uk |
Dr. Andreas Stolcke |
SRI International, Menlo Park, CA. |
stolcke@speech.sri.com |
back
to top
General
Chair
Li DENG, Microsoft Research,
USA, deng@microsoft.com
Technical
Chairs
Zicheng LIU,
Microsoft Research, USA, zliu@microsoft.com
Adriana
DUMITRAS, Apple Computer, USA, adrianad@ieee.org
Zhengyou
ZHANG, Microsoft Research, USA, zhang@microsoft.com
Special
Sessions Chair
Shahram SHIRANI,
McMaster Univ., Canada, shirani@ece.mcmaster.ca
Finance
Chair
Belle TSENG, NEC
Labs America, USA, belle@sv.nec-labs.com
Publicity
Chair
Registration
Chair
George
TZANETAKIS, Univ. Victoria, Canada, gtzan@cs.uvic.ca
Electronic
Media Chair
Zhen WEN, IBM
Research, USA, zhenwen@us.ibm.com
Publications
Chairs
George
TZANETAKIS, Univ. Victoria, Canada, gtzan@cs.uvic.ca
Haitao
ZHENG, Univ. California Santa Barbara, USA, htzheng@ieee.org
Local Arrangements Chairs
Ali SHOJA,
Univ. Victoria,
Canada, gshoja@cs.uvic.ca
Dong YU, Microsoft Research, USA,
dongyu@microsoft.com
Asia
Liaison
Sadaoki FURUI,
Tokyo Inst. of Tech., Japan, furui@cs.titech.ac.jp
Europe
Liaison
Pascal FROSSARD, EPFL, Switzerland, pascal.frossard@epfl.ch
|
IEEE Signal
Processing Society
2006 International Workshop
on Multimedia Signal Processing
October 3-6,
2006
Fairmount
Empress Hotel
Victoria,
BC, Canada
|
MMSP-06 Call for Papers
MMSP-06 is the
eighth international workshop on multimedia signal processing organized
by the Multimedia Signal Processing Technical Committee of the IEEE
Signal Processing Society. The MMSP-06 workshop features several
new components that include:
·
A Student Paper Contest with awards sponsored by
Microsoft Research. To enter the contest a paper submission must have a
student as the first author
·
Overview sessions that consist of papers presenting the
state-of-the-art in methods and applications for selected topics of
interest in multimedia signal processing
·
Wrap-up presentations that summarize the main
contributions of the papers accepted at the workshop, hot topics and
current trends in multimedia signal processing
·
New content requirements for the submitted papers
·
New review guidelines for the submitted papers
SCOPE
Papers are solicited for, but not limited
to, the general areas:
·
Multimedia Processing (modalities: audio, speech, visual,
graphics, other; processing: pre- and post- processing of multimodal
data, joint audio/visual and multimodal processing, joint
source/channel coding, 2-D and 3-D graphics/geometry coding and
animation, multimedia streaming)
·
Multimedia Databases (content analysis,
representation, indexing, recognition, and retrieval)
·
Multimedia Security (data hiding, authentication,
and access control)
·
Multimedia Networking (priority-based QoS control and
scheduling, traffic engineering, soft IP multicast support, home
networking technologies, wireless technologies)
·
Multimedia Systems Design, Implementation and
Applications (design: distributed multimedia systems, real-time and non
real-time systems; implementation: multimedia hardware and software;
applications: entertainment and games, IP video/web conferencing,
wireless web, wireless video phone, distance learning over the
Internet, telemedicine over the Internet, distributed virtual reality)
·
Human-Machine Interfaces and Interaction using multiple
modalities
·
Human Perception (including integration of art and
technology)
·
Standards
SCHEDULE
·
Special Sessions (contact the respective chair by): March
8, 2006 (Call for Special Sessions)
·
Papers (full paper, 4-6 pages, to be received
by):
April 8, 2006 (Instructions for Authors)
·
Notification of acceptance by: June 8, 2006
·
Camera-ready paper submission by: July 8,
2006
(Instructions for Authors)
Check the workshop
website http://research.microsoft.com/workshops/MMSP06
for updates.
|
back
to top
THE
BIOMETRIC CONSORTIUM 2005 CONFERENCE
September
19-21,
2005
Hyatt Regency
Crystal City
Arlington, VA, U.S.A.
(703) 418 1234
Conference Website:
http://www.nist.gov/bc2005
Registration
fee:
$540.00:300
The program includes two and a
half days of presentations, seminars and panel discussions with the
participation of internationally recognized experts in biometric
technologies, system and application developers and users. Over 100
speakers from government, industry, and academia will participate. The
multi-track conference offers technology seminars and over 70 biometric
technology exhibits (open during the entire conference). The program
includes the Tutorial “Introduction to Biometrics” and
the Research Symposium (a
special session on research offered by the Biometric Knowledge Center
of
the Center for Identification Technology Research (CITeR).
The Biometric Consortium conference will address the latest trends in
biometrics research, development, testing and evaluation as well
applications of biometric technology and the current status of
biometric
standards and the adoption of these standards. It will also address the
important role that biometrics can play in the identification and
verification of individuals in this age of heightened security and
privacy by examining biometric-based solutions for homeland security as
well as the utilization of biometrics in other government and
commercial
applications.
Scheduled sessions include major government efforts such the
Executive Office the President of the United States, National Science
& Technology Council, Subcommittee on Biometrics, and government
programs within Department of Defense, Department of Homeland Security,
Department of Justice, National Institute of Standards and Technology;
as
well Biometric Solutions & Applications, Large-ScaIe Biometric
Implementations, Advances in Biometric Technologies, Security of
Biometrics, Biometric Standards, Special Sessions on Biometrics and
Physical Access Control and Privacy Policy and Cross Jurisdictional
Issues of the Use for Biometrics and Biometric Testing. Attendees will
also learn from the Biometric Industry Association (IBIA) and the
National Biometric Security Project.
Sponsors:
The following organizations are proud to co-sponsor this event:
Information Technology Laboratory (ITL) of the National Institute of
Standards and Technology (NIST)
National Security Agency (NSA)
DoD Biometrics Management Office (BMO)
National Biometric Security Project (NBSP)
Department of Homeland Security (DHS)
General Services Administration's Office of Governmentwide Policy
National Institute of Justice (NIJ)
West Virginia Development Office (WVDO).
Supporting
Organizations:
Executive Office of the President of the United States, National
Science & Technology Council, Subcommittee on Biometrics
American National Standards Institute (ANSI)
BioAPI Consortium
Center for Identification Technology Research (CITeR).
International Biometrics Industry Association (IBIA)
InterNational Committee for
Information Technology Standards (INCITS)
The Biometric Foundation
Location:
The Conference will be held at the Hyatt Regency Crystal City,
Arlington, VA, USA. More information about the hotel can be found
at:
http://www.hyatt.com/usa/arlington/hotels/hotel_wasrc.html
Conference
Registration:
The registration
fee ($540.00 per person) includes coffee
breaks, lunches
(Monday and Tuesday), reception on Monday evening, entrance to exhibit
hall and
conference materials.
Accommodations:
The Conference registration does not include your hotel
reservation.
A block of rooms has been reserved at the Hyatt Regency Crystal City,
(703) 418-1234. To register for a room, please contact the Hyatt and
mention that you are attending "The Biometric Consortium
Conference" no later than Sunday August 28, 2005.
For more information or to register for the Conference please check
our web site
http://www.nist.gov/bc2005
Collocated
activities:
The following meetings will take place collocated with the
conference. Details on these meetings (open to the public) will be
posted
in the conference web site:
InterNational Committee for Information Technology
Standards
Technical Committee M1 (Biometrics) Ad Hoc Group on Biometrics
and
E-Authentication (AHGBEA)
back
to top
WNSP05:
WORKSHOP ON NON-LINEAR
SPEECH PROCESSING
Heraklion
, Crete, Greece
20-23 Sept 2005
WNSP05 is organized by the Management Committee (MC) http://www.nolisp2005.org/cost/index.php?content=participation.htm#management
of COST-277 European Research Action <http://www.nolisp2005.org/cost/>
as a parallel event to the last meeting of the Action. Members of the
MC of COST 277 are invited to present overviews as well as novel ideas
on nonlinear speech processing algorithms, methods, data collection and
applications.
This is a unique opportunity for researchers outside the Action to
submit their work, sharing their view with the members of the Action.
This meeting will give engineers and PhD students a complete picture of
what the challenges and solutions are and in what directions research
in nonlinear speech processing is moving.
The goal of WNSP05 is to provide an overview of research on nonlinear
speech processing and indicating future directions. We plan to provide
Multipoint Video streaming so people from the connected sites would be
able to interact with the participants of the meeting. Also Best Effort
Video streaming will be available while selected talks will be video
recorded for online access later on. Selected contributions will
be published in a book edited by members of the MC of the Action.
--
Yannis Stylianou, Associate Professor
Department of Computer Science
University of Crete
Leof. Knossou
714 09 Heraklion, Crete, Greece
Ph: +30 2810 393 559
Fax: +30 2810 393 501
back
to top
2005 IEEE Automatic Speech
Recognition and Understanding Workshop
November 27 - December 1, 2005
Fiesta Americana Grand Coral Beach
Resort
Cancun, Mexico
www.asru2005.org
REMINDER: CALL FOR
DEMONSTRATIONS
The ninth bi-annual IEEE workshop on Automatic Speech Recognition and
Understanding (ASRU) will be held November 27-December 1, 2005. The
venue will be the Fiesta Americana Grand Coral Beach Resort in Cancun,
Mexico. The ASRU workshops have a tradition of bringing together
researchers from academia and industry in an intimate and collegial
setting to discuss problems of common interest in automatic speech
recognition and understanding. Prospective demonstrators are
invited to submit 1-2 page proposals via
the workshop website for demonstrating leading edge technology within
the broad scope of the workshop. Proposals in all areas of human
language technology are encouraged to be submitted with emphasis
placed on:
- automatic speech recognition and understanding
technology
- speech to text systems
- spoken dialog systems
- multilingual language processing
- robustness in ASR
- spoken document retrieval
- speech-to-speech translation
All demo proposals will be handled and reviewed electronically.
Please identify any special audio-visual needs in your proposal.
IMPORTANT DATES
Aug. 29, 2005
|
Demo submission deadline |
September 2, 2005
|
Demo decision notices emailed |
September 15, 2005
|
Author registration deadline |
October 1, 2005
|
Workshop early registration deadline |
October 27, 2005 |
Hotel reservation deadline |
Nov. 27 - Dec. 1, 2005
|
Workshop |
back
to top
HLT/EMNLP 2005:
Registration and Student Volunteer
Recruitment
Human Language Technology Conference/
Conference on Empirical Methods in Natural Language Processing
Vancouver,
B.C., Canada
Conference date: October 6-8, 2005
Early registration deadline:
Sept 1, 2005
Student volunteer application deadline: Aug 26, 2005
URL: http://www.hlt-emnlp05.org
In 2005, HLT (Human Language Technology Conference) and EMNLP
(Conference on Empirical Methods in Natural Language Processing) will
be a joint
conference held in Vancouver, B.C.
HLT/EMNLP 2005 continues the conference series jointly sponsored by the
Human Language Technology Advisory Board (HLT) and the Association for
Computational Linguistics (ACL). This year's conference is
co-sponsored by SIGDAT, the ACL's special interest group on linguistic
data and corpus-based approaches to NLP, which has traditionally
sponsored the Empirical Methods in Natural Language Processing (EMNLP)
Conferences.
The program will include presentation of 98 research talks, 30 research
posters, and 20 system demonstrations covering a broad selection of
areas within human language technology, see the website for a detailed
list of papers and demonstrations. There will also be invited
talks by Larry Hunter, University of Colorado, on NLP for
bioinformatics and Sanjeev Khudanpur, Johns Hopkins University, on NLP
and speech systems. Registration for the conference is now available
on-line at the website: http://www.hlt-emnlp05.org
For payment, there is a secure connection to Plug N Pay or you may also
download the form, complete it and fax or mail it to Priscilla
Rasmussen with your payment:
Priscilla Rasmussen
HLT/EMNLP
3 Landmark Center
East Stroudsburg, PA 18301
Fax: +1-570-476-8006
Tel: +1-570-476-8006
HLT-EMNLP 2005: Call for Student
Volunteers
HLT-EMNLP 2005 is looking for student volunteers. In exchange for
one full day's work, student volunteers receive free registration to
the main conference. The work will be divided, probably into two
half-day shifts, and the shifts will be scheduled to maximize volunteer
access to the conference events. We may be able to provide other
amenities, and aim to provide a good work environment.
Tasks will include assisting at the registration desk, stuffing
delegate packs, and providing technical assistance for conference
events. Wishes about attendance to specific conference events
will be taken into account when possible in allocating tasks.
Volunteers are needed from the morning of October 5th through October
10th, 2005. (But we do not require all volunteers to arrive by
October 5th or stay until October 10th. However, if you can come to
Vancouver for one day only we may not be able to assign you to a
volunteer task.)
The Student Volunteer Program is open to full-time students only. There
is no application form. Apply by sending email to the following
address:
"Anoop Sarkar" <anoop@cs.sfu.ca>
**Important**: put "HLT-EMNLP-05 Student Volunteer Application" in the
subject header of your message.
In your application, include the dates you can be in Vancouver, whether
you are presenting a paper or poster, what academic program you are
enrolled in (which institution and whether UGrad/MSc/Phd), and what
year of your program you are in. Include any other information
that you think may be important.
The application deadline is Monday, August 26, 2005. We aim to complete
the entire selection process before August 30. Notification will be
done by email as we receive application emails. Please do *not*
register online as a student volunteer until you have received a
notification email. The early registration deadline is September 1,
2005 -- you will have time to get early registration rate even if you
do not receive a volunteership.
Student Volunteer Coordinators for HLT-EMNLP 2005:
Anoop Sarkar <anoop@cs.sfu.ca>
Fred Popowich <popowich@cs.sfu.ca>
back
to top
ISCA, together with the Interspeech 2005 - Eurospeech organizing
committee, would like to encourage participation in the upcoming
conference.
Interspeech 2005 - Eurospeech is the 9th biennal conference of the
International Speech Communication Association (ISCA) and the sixth in
the annual series of Interspeech events, after Beijing, Aalborg,
Denver, Geneva and Jeju.
This year's theme is "Ubiquitous Speech Processing", a long-term goal
for this new millenium, which is also the topic of the panel of our 4th
and last plenary session. The conference follows the typical format of
Interspeech events, with parallel oral and poster sessions, and a first
Tutorial Day program featuring six in-depth tutorials from world-class
experts in their field.
The conference will be held September 4-8 in Lisbon, Portugal, at
Centro Cultural de Belem, located in the city's most renowned historic
area, next to the Jeronimos Monastery and facing the river Tagus.
Perched on the edge of the Atlantic Ocean, Lisbon is a city like no
other. The city center is a buzzing grid of grand streets with
all the trappings of any other Western European city, but also rumbling
old trams, antiquated funiculars and elevators that help negotiate
Lisbon's steep hills.
The Technical Program will include parallel oral and poster sessions,
totaling 781 papers, selected from 1261 regular full paper submissions.
Additionally, it will include 10 Special Sessions on new, emerging
areas of interest to the spoken language processing community.
KEYNOTE SPEAKERS:
- Monday 5th: ISCA Medalist
- Tuesday 6th: Fernando Pereira, Univ. Pennsylvania, USA
Title: Linear Models for Structure Prediction
- Wednesday 7th: Elizabeth Shriberg, SRI and ICSI, USA
Title: Spontaneous Speech:
How People Really Talk, and Why Engineers Should
Care
TUTORIALS:
Tutorials for Interspeech'2005 will be held on Sunday, September 4th,
organized by internationally recognized experts in their fields. The
idea behind the tutorials is t provide an overview of the topic
as well as to bring to light the recent developments in fields
concerning spoken language processing.
- Languages for Developing Multimodal Applications
James A. Larson
- VoIP Technologies and Speech Applications
Gerald Karam and Giuseppe Di Fabbrizio
- Usability Theory and Practice in Speech Interaction System Design
Fang Chen and Kristiina Jokinen
- Visions, Technology and Business of Conversational Machines
Mazin Gilbert and Roberto Pieraccini
- Forensic Speech Science
Anders Eriksson and Andrzej Drjgajlo
- Building Synthetic Voice
Alan Black
OTHER EVENTS:
- Monday 5th: Opening Ceremony
Welcome Reception (Cloisters)
- Tuesday 6th: ISCA SIG Meeting
Student Reception
Scientific Review Committee Reception (Castle)
- Wednesday 7th: Panel - Towards a SIG on Iberian Languages
Chair: Nestor Yoma
Panel - History of Speech Technology
Chairs: Janet Baker & Patri J. Pugliese
Panelists: Gunnar Fant, Jim Flanagan,
Hiroya Fujisaki, Joseph Mariani, Louis Pols
ISCA General Assembly
Banquet
- Thursday 8th: Panel - Ubiquitous Speech Processing
Chair: Roger K. Moore
Panelists: Alex Acero, Jordan Cohen,
Paul Dalsgaard and Sadaoki Furui
Closing Session / History of Fado
ACCOMMODATION:
A large number of rooms has been reserved in several hotels. Hotel
booking can be done through the travel agency Viagens Abreu, by
completing the Hotel Reservation Form (see website) and returning with
payment, no later than June 30th.
For further information: www.interspeech2005.org or
send email to info@interspeech2005.org
Organizer:
L2F - Spoken Language Systems Laboratory, INESC ID Lisboa
Rua Alves Redol, 9 - 1000-029 Lisbon - Portugal
Phone:+351 213100268 Fax: +351 213145843 www.l2f.inesc-id.pt
back
to top
SPEECH
RESEARCH OPENINGS AT ICSI
The International Computer Science Institute (ICSI) invites
applications for positions in speech processing. Interested parties
with a range of experience (e.g., both recent PhDs and those with more
extensive experience) are encouraged to apply.
ICSI is an independent not-for-profit Institute located a few blocks
from the Berkeley campus of the University of California. It is closely
affiliated with the University, and particularly with the Electrical
Engineering and Computer Science (EECS) Department. See http://www.icsi.berkeley.edu
to learn more about ICSI.
The ICSI Speech Group (including its predecessor, the ICSI Realization
Group) has been a source of novel approaches to speech processing since
1988. It is primarily known for its work in speech recognition,
although it has housed major projects in speaker recognition, metadata
extraction, and speech coding in the last few years.
Applications should include a cover letter, vita, and the names of at
least 3 references (with both postal and email addresses). Applications
should be sent by email to morgan@icsi.berkeley.edu
and by postal mail
to
Director (re Speech Search)
ICSI
1947 Center Street
Berkeley, CA 94704
ICSI is an Affirmative Action/Equal Opportunity Employer. Applications
from women and minorities are especially encouraged. Hiring is
contingent on eligibility to work in the United States.
back
to top
SPEECH RESEARCH OPENINGS AT SCANSOFT
Job
Description: Senior
language modeling scientist
Responsibilities:
The successful candidate will play a key role in developing
and improving techniques for language modeling and text processing as
part of
our transcription software. The result will be more accurate, robust,
and
functional software for a variety of transcription software projects.
You will
join a small and talented group of language modeling scientists in our Burlington,
MA offices.
Qualifications: You should have a good degree in speech
recognition,
computational linguistics, or a related field, and preferably a Ph.D. A
background in speech, mathematics, computer science, and computational
linguistics is highly desirable. You should have strong communications
skills
and be able to work collaboratively with others on the team and
elsewhere
within the company. Strong software skills are required, especially
Unix/Linux,
C/C++, and a scripting language such as Perl or Python. Explicit
experience
with language modeling for transcription is a plus.
We offer a stimulating research environment with
substantial
data and computing resources. A competitive salary and benefit package
will be
offered. Please send your resume to Jeff Adams at jeff.adams@scansoft.com .
Job
Description: Senior research scientist in speech
recognition
Responsibilities:
The candidate will work on improving recognition performance
of speech recognition engine and its main application in Scansoft’s
award-winning dictation products. Dragon NaturallySpeaking® is our
market-leading desktop dictation product. The recently released version
8
showed substantial accuracy improvements over previous versions.
DragonMT is
our new medical transcription server, which brings the benefit of
ScanSoft’s
dictation technology to the problem of machine assisted medical
transcription.
We are looking for an individual who wants to solve difficult speech
recognition problems, and help get those solutions into our products,
so that
our customers can work more effectively.
Main responsibilities of the candidate will
include:
- provide experimental and theoretical
analysis of speech recognition problems
- formulate new algorithms, create
research tools, design and carry out experiments to verify new
algorithms
- work with other members in the team to
improve the performance of our products and add new product features to
meet business requirements
- work with other team members to
deliver acoustic models for products
- work with development engineers to
insure a high quality implementation of algorithms and models in
company products
- follow developments in speech
recognition to keep our research work state-of-the-art
- patent new algorithms and write
scientific papers when appropriate
Qualifications:
- Ph.D. or Master degree in computer
science or electrical engineering
- good analytical and diagnostic skills
- experience with C/C++, scripting using
Perl, Python and csh in UNIX environment
- ability to work with a large existing
code base
- desire and ability to be a team player
- strong desire and demonstrated ability
to work on and solve engineering problems
Preference will give to candidates who have strong speech
recognition background. Previous involvement in DARPA EARS project is a
plus.
New graduates with good GPA from top universities are encouraged to
apply.
Job
Description: Senior
Data Engineer
Responsibilities: The successful candidate will play a key
role in supporting the
vast amounts of data used in designing, training, and testing our
speech transcription
software. The result will be more accurate, robust, and functional
software for
a variety of transcription software projects. You will join a small and
talented group in our Burlington, MA
offices. You will help us collect, organize, and prepare speech
&
text data in support of our research into speech recognition
technology. You
should have strong computer skills, to include Unix, Perl (or a similar
scripting language), C/C++, and a familiarity with text & audio
encoding
formats. A background in speech & language processing or research
is highly
desirable, as is experience working with very large corpora (speech and
or
text).
Qualifications:
You will be self-motivated, able to plan the details
of
accomplishing a given task, and able to anticipate and prepare for
future
needs. This is a senior position, and the successful candidate would be
expected to familiarize themselves with our needs and processes, and to
take
initiative to find and fill gaps in our data, or fix weaknesses in our
processes. This employee should be "lazy" enough to find automated,
permanent solutions to problems, rather than temporary fixes.
We offer a stimulating research environment with substantial
data and computing resources. A competitive salary and benefit package
will be
offered.
Please send your resume to Jeff
Adams at jeff.adams@scansoft.com
.
back
to top
ASR
Researchers Take New Positions
The STC Newsletter would like to provide announcements of
professors, researchers, and developers in the speech area
taking new positions. If you have moved lately or are in the
process of moving to a new position in the new future, send your
new contact
information to the STC Newsletter so it can be posted in the next
edition.
- In August 2005, Michiel Bacchiani moved from the IBM T.J.
Watson Research Center in Yorktown Heights, NY to Google in New York,
NY.
Google Inc.
1440 Broadway,
New York, NY 10018
michiel at google dot com
- Starting on July 27 2005, Roberto Pieraccini took the position of
CTO at Tell-Eureka (www.telleureka.com),
a Manhattan based company specializing in spoken dialog systems for
selected vertical markets. He has left his previous position at
IBM T.J. Watson Research Center in Yorktown Heights, NY
robertopieraccini at yahoo dot com
Links to
Upcoming Conferences and Workshops
(Organized by Date)
Auditory-Visual Speech Processing (AVSP 2005)
Vancouver Island, British Columbia, Canada, July24-27
http://marcs.uws.edu.au/links/avisa/avsp05
SIGdial Workshop on Discourse and Dialog
Lisbon, Portugal , September 2-3, 2005
http://www.sigdial.org/workshops/workshop6
ITRW 2005 Workshop on DSP for IN-VEHICLE and MOBILE SYSTEMS
Sesimbra, Portugal, September 3, 2005
http://dspincars.sdsu.edu/
EUROSPEECH 2005 9th European Conference on Speech Communication
and Technology
Lisbon, Portugal, September 4-8, 2005
http://www.interspeech2005.org/
Disfluency in Spontaneous Speech
Aix-en-Provence, September 10-12, 2005
http://www.up.univ-mrs.fr/delic/Diss05
WNSP-2005 Workshop on Non-Linear
Speech Processing
Heraklion, Crete, Greece, September 20-23, 2005
http://www.ics.forth.gr/wnsp05/index.html
HLT/EMNLP 2005
Vancouver, B.C., Canada, October 6-8, 2005
http://www.hlt-emnlp05.org
IEEE WASPAA2005 Workshop on Applications of Signal Processing to
Audio and Acoustics
New Paltz, New York, October 16-19, 2005
http://www.LNT.de/~WASPAA05/
SPECOM 2005 - 10th International Conf. on Speech and Computers
Patras, Greece, October 17-19, 2005
http://www.wcl.ee.upatras.gr/specom2005.htm
IEEE ASRU2005 Automatic Speech Recognition and Understanding
Workshop
Cancun, Mexico, November 27 - December 1, 2005
http://www.asru2005.org
ICASSP2006
Toulouse, France May 15-19, 2006
http://www.icassp2006.org
INTERSPEECH 2006 - ICSLP
Pittsburgh, PA, USA September 17-21, 2006
http://www.interspeech2006.org/
MMSP-2006
Victoria, BC Canada October 3-6, 2006
http://research.microsoft.com/workshops/MMSP06
IEEE/ACL Workshop on SLT
Aruba, Deccember 10-13, 2006
ICASSP2007
Honolulu, Hawaii, USA, 2007, April 17-20
<>>
INTERSPEECH 2007
Antwerp, Belgium, August 27-31, 2007
http://www.interspeech2007.org/
back to top