On August 24 2017, the Audio Analysis Lab’s annual workshop, called the Audio Analysis Workshop, was held at AD:MT in Rendsburggade 14 in Aalborg. This year’s edition is sponsored by the Independt Reserach Fund Denmark via our project on diagnosis of Parkinsons disease from voice signals. There were 25 participants from the lab, industrial partners, and universities abroad. The workshop featured 18 talks on ongoing research on topics such as frequency estimation, music signal modeling, distributed signal processing, Parkinsons, noise reduction, hearing aids, array processing, and speech intelligibility prediction. This was the sixth edition of the workshop which started in 2012 with the foundation of the Audio Analysis Lab.
On Sunday morning August 20, Audio Analysis Lab members Jesper Kjær Nielsen, Jesper Rindom Jensen, and Mads Græsbøll Christensen gave a three-hour tutorial at INTERSPEECH in Stockholm, Sweden. The tutorial was entitled Statistical Parametric Speech Processing: Solving Problems with the Model-based Approach and included a lot of the lab’s work in recent years on model-based methods for processing of speech and audio signals. The tutorial was popular with 70+ participants signed up! More info about the tutorial can be obtained here. INTERSPEECH 2017 emphasizes an interdisciplinary approach covering all aspects of speech science and technology spanning basic theories to applications.
Professor Barry Quinn will be visiting the Audio Analysis Lab this fall as Guest Professor. He is Professor of Statistics, Dept. of Statistics, Macquarie University, Sydney, Australia. In the signal processing community, he is well-known for his fundamental work on autoregressive processes, frequency estimation and order estimation. While visiting Aalborg University, he will be working with the member of Audio Analysis Lab on our various research projects and will giving a Ph.D. course entitled The Estimation of Frequency. Course details can be found at https://phdcourses.dk/Course/54629.
We are extremely proud to announce that Audio Analysis Lab founding member Assistant Professor Jesper Rindom Jensen has been named Teacher of the Year by the Study Board of Media Technology following multiple nominations by students for his tireless efforts in the Media Technology B.Sc. program and the Sound & Music Computing M.Sc. program where he both teaches courses and supervises student projects. The study board is responsible for the curricula and the quality assurance of the following programs: IT, Communication and New Media, Lighting Design, Medialogy, Service Systems Design, Sound and Music Computing.
We are pleased to announce that the Audio Analysis Lab will be giving a tutorial this year at Interspeech 2017. The tutorial is entitled Statistical Parametric Speech Processing: Solving Problems with the Model-based Approach and covers much of the lab’s research in present and past projects. Interspeech 2017 will be held in beautiful Stockholm, Sweden August 20-24, and the tutorial will be held 9:30-12:00 on August 20. The tutorial will be given by Assistant Professor Jesper Rindom Jensen, Assistant Professor Jesper Kjær Nielsen, and Professor Mads Græsbøll Christensen. You can read more about the tutorial and the other tutorials here and you can sign up at the Interspeech homepage here once the registration opens. Below, you can also find additional information about the tutorial.
Title: Statistical Parametric Speech Processing: Solving Problems with the Model-based Approach
Organizers: Jesper Rindome Jensen, Jesper Kjær Nielsen, and Mads Græsbøll Christensen
Abstract: Parametric speech models have been around for many years but have always had their detractors. Two common arguments against such models are that it is too difficult to find their parameters and that the models do not take the complicated nature of real signals into account. In recent years, significant advances have been made in speech models and robust and computationally efficient estimation using statistical principles, and it has been demonstrated that, regardless of any deficiencies in the model, the parametric methods outperform the more commonly used non-parametric methods (e.g., autocorrelation-based methods) for problems like pitch estimation. The application of these principles, however, extend way beyond that problem. In this tutorial, state-of-the-art parametric speech models and statistical estimators for finding their parameters will be presented and their pros and cons discussed. The merits of the statistical, parametric approach to speech modeling will be demonstrated via a number of number of well-known problems in speech, audio and acoustic signal processing. Examples of such problems are pitch estimation for non-stationary speech, distortion-less speech enhancement, noise statistics estimation, speech segmentation, multi-channel modeling, and model-based localization and beamforming with microphone arrays.
Audio Analysis Lab founder and head Mads Græsbøll Christensen was induceted into the Danish Academy of Technical Sciences (ATV), along with 39 other new members, on April 26 2017 during the annual meeting in Copenhagen. You can read the press release here.
The Danish Academy of Technical Sciences (ATV) is an independent, member-driven think tank. ATV’s vision is that Denmark shall be one of five leading Science and Engineering regions in the world – to the benefit of future generations. In order to achieve this objective, ATV is undertaking a number of activities to the advantage of businesses, knowledge institutions and society as a whole. ATV has 800 members who are research directors, business executives, leading researchers and experts within their field.
In an effort to increase the visibility of the lab and our research, we have launched the Audio Analysis Lab YouTube channel! On the channel, we will post videos about our research, ongoing and past. The videos will be based on our presentations of papers given at conferences, Ph.D. defenses, etc., but willl also include demos. The newly launched channel already features the following videos:
- Estimation of Multi-Pitch Signals Stereophonic Mixtures
- Pitch Estimation for Non-Stationary Speech
- Localization of Sound in Reverberant Environments
- Statistical Parametric Speech Processing
Below you can see one of the videos and you can access the channel here or from the menu on the homepage.
The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017 is being held March 5-9, 2017 in New Orleans, USA. As usual, the Audio Analysis Lab is well-represented at the top signal processin conference in the world with the following presentations:
- MODEL BASED BINAURAL ENHANCEMENT OF VOICED AND UNVOICED SPEECH
- LEAST 1-NORM POLE-ZERO MODELING WITH SPARSE DECONVOLUTION FOR SPEECH ANALYSIS
- PITCH-BASED NON-INTRUSIVE OBJECTIVE INTELLIGIBILITY PREDICTION
- DISTRIBUTED MAX-SINR SPEECH ENHANCEMENT WITH AD HOC MICROPHONE ARRAYS
- HARMONIC MINIMUM MEAN SQUARED ERROR FILTERS FOR MULTICHANNEL SPEECH ENHANCEMENT
- ESTIMATION OF MULTIPLE PITCHES IN STEREOPHONIC MIXTURES USING A CODEBOOK-BASED APPROACH
- FAST HARMONIC CHIRP SUMMATION
- GREEDY ALTERNATIVE FOR ROOM GEOMETRY ESTIMATION FROM ACOUSTIC ECHOES: A SUBSPACE-BASED METHOD
The Audio Analysis Lab and its activities were featured in BrainBusinness’ newsletter in December. The article focuses on the research acitivities in signal processing for hearing aids and our research in voice analysis for diagnosis of Parkinson’s disease. You can read the article here. BrainsBusiness is a unique platform for ICT innovation in North Denmark through the interaction of industry and university and the link to public authorities. The overall aim of BrainsBusiness is to contribute to the North Denmark ICT cluster becoming recognised as one of the most attractive and competitive ICT clusters in Europe.
At the Faculty of Engineering and Science, as of 1 January 2017 Technical Faculty of IT and Design, Department of Architecture, Design and Media Technology a PhD stipend is available within the general study programme Electrical and Electronic Engineering. The stipend is open for appointment from 1 January 2017 or as soon as possible hereafter.
The position is with the research group Audio Analysis Lab. The PhD student will work on a research project entitled Signal Processing for Sound Zones.
Sound zones are spatially confined regions in which different audio contents can be enjoyed in an acoustic environment. Thus, sound zones replace headphones as a means of creating an individualized listening experience while also allowing for social interaction. There exists many potential applications of this concept, including in home entertainment, museums, car cabin, and at hospitals. They can be created using loudspeaker arrays (a number of loudspeakers organized in a geometry) by altering the phase and amplitude of the loudspeaker signals. The state of the art can, however, typically only achieve an attenuation of 10-15 dB of interfering sounds from other zones (depending on the setup), which means that the interference is clearly audible and annoying. In short, the concept, while promising, does not presently work well enough for most applications. This project aims at making high-quality sound zones feasible via advanced signal processing.
The successful applicant should have a M.Sc. (or equivalent) in engineering within signal processing. Prior experience with audio and acoustic signal processing is a plus but not required. Moreover, the successful applicant should be fluent in English, have strong programming and math skills, and be familiar with MATLAB (or similar tools). The applicant must submit his/her M.Sc. thesis (or a draft thereof) as part of the application. The degree must be completed at the time of the appointment.
The Audio Analysis Lab at Aalborg University conducts basic and applied research in signal processing theory and methods aimed at or involving analysis of audio signals. The research focuses on problems such as compression, analysis, classification, separation, and enhancement of audio signals, as well as localization, identification and tracking using microphone arrays. The lab and its members are currently funded by grants from the Villum Foundation, the Danish Council for Strategic Research, the Danish Council for Independent Research, and Innovations Fund Denmark. The research projects are carried out in close collaboration with leading industrial partners and universities around the world.
You may obtain further information from Professor Mads Græsbøll Christensen, Audio Analysis Lab, Department of Architecture, Design and Media Technology , phone: +45 9940 9793, email: email@example.com concerning the scientific aspects of the stipend.
PhD stipends are allocated to individuals who holds a Master’s degree. PhD stipends are normally for a period of 3 years. It is a prerequisite for allocation of the stipend that the candidate will be enrolled as a PhD student at the Technical Doctoral School of IT and Design, in accordance with the regulations of Ministerial Order No. 1039 of August 27, 2013 on the PhD Programme at the Universities and Certain Higher Artistic Educational Institutions. According to the Ministerial Order, the progress of the PhD student shall be assessed every six months. It is a prerequisite for continuation of salary payment that the previous progress is approved at the time of the evaluation.
The qualifications of the applicant will be assessed by an assessment committee. On the basis of the recommendation of the assessment committee, the Dean of the Faculty of Engineering and Science will make a decision for allocating the stipend.
For further information about stipends and salary as well as practical issues concerning the application procedure contact Ms. Bettina Wedde, The Faculty of Engineering and Science, email: firstname.lastname@example.org, phone: +45 9940 9909.
You can read more and apply at http://www.stillinger.aau.dk/vis-stilling/?vacancy=875102.