1 / 43

Jonas Braasch Communication Acoustics and Aural Architecture Research Laboratory (C A 3 R L)

An automated calibration system for telematic music applications. Jonas Braasch Communication Acoustics and Aural Architecture Research Laboratory (C A 3 R L) Rensselaer Polytechnic Institute, Troy, NY http://symphony.arch.rpi.edu/~carl. Steinberg and Snow (1934). Location A. Location B.

fayola
Download Presentation

Jonas Braasch Communication Acoustics and Aural Architecture Research Laboratory (C A 3 R L)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. An automated calibration system for telematic music applications Jonas Braasch Communication Acoustics and Aural Architecture Research Laboratory (CA3RL) Rensselaer Polytechnic Institute, Troy, NY http://symphony.arch.rpi.edu/~carl

  2. Steinberg and Snow (1934) Location A Location B

  3. Biggest Challenges • Bandwidth • Transmission latency • Feedbacks • Communication during setup. …you will need to make friends with the sysadmin

  4. Necessary Bandwidth • Telematic Circle • The transmission of DV quality video requires a bandwidth of 25 Megabits/s • 8 channels of CD quality audio about 5.5 Megabits/s. • McGill University (Jeremy Cooperstock) • Gigabit/s AV connection for HD uncompressed • Fraunhofer-Erlangen • New Fraunhofer/Erlangen low-latency coder AAC-LD (about 5 ms for the coding/decoding process), will be integrated into CISCOs’ system.

  5. Transmission Delay • Acceptable latency for music: 25 milliseconds. • Speed of light • Signal traveling between RPI in Troy, NY and CCRMA at Stanford University, Palo Aalto, CA: 14 ms for the distance of 4,111 km (direct line). • 54 ms for a connection between New York and Australia (16,000 km) • Speed of sound • 14 ms=6 m (c=430 m/s at room temperature) • Total delay • transmission delay (determined by physical distance propagation speed of the signal) • signal-processing delay

  6. Feedback loop in PA systems

  7. Feedback loop in telematic connections Longer transmission delays are easier to detect audibly!

  8. Echo feedback • Audible colorations and echoes are a common side effect in two-way transmission systems. • Audio/videoconferencing systems such as iChat or Skype use echo-cancellation systems to suppress feedbacks. • In speech communication echo-cancellation systems work well, since the back-and-forth nature of spoken dialogue usually allows to temporarily suppress the transmission channel in one direction. • In simultaneous music communication, however, this procedure tends to cut-off part of the performance. • Solution: capture music signals with closely spaced microphones (e.g., lavalier microphones).

  9. Telematic Circle Members • Deep Listening Institute • Pauline Oliveros, Sarah Weaver • Rensselaer Polytechnic Institute • Curtis Bahn, Jonas Braasch, Pauline Oliveros • Stanford University (CCRMA) • Chris Chafe, Ge Wang • University of California San Diego • Mark Dresser, Shahrokh Yadegari, Adriene Jenik, Victoria Petrovich • McGill University • Jeremy Cooperstock, Bill Martens

  10. Telematic Circle Concerts • March 22: RPI, Northwestern, Stanford • June, 26: ICAD 2007, Montreal • McGill, Korea, RPI, Stanford • Aug 6-7, SIGGRAPH 2007, San Diego • UCSD, RPI, Stanford/Banff • Nov 16, 2007: Stanford, RPI, UCSD • Dec 14, ISIM 2007: RPI, Northwestern • Aug 28, ICMC 2008, RPI, Belfast, Stanford

  11. Telematic transmission scheme

  12. from EMPAC webpage

  13. Problem • Automatic tracking of actors required during a theater film shoot • Typically, the actors are recorded closely with lavalier microphones + good dry sound quality  Spatial aspects of the recordings are lost and need to be recovered

  14. Possible Solutions to track actors • Optical tracking: works well but can fail in bright stage light • GPS: only works outdoors • Electromagnetic tracking (e.g., via sender for lavalier mic): works only outdoors (reflections occur indoors) • Acoustic tracking: works well for single sound sources, problematic in multiple sound source scenarios

  15. Acoustic Tracking Solutions • Beamforming • Methods based on interchannel delay • Both methods work only well for single sound sources • Need to find time-frequency windows in which only one sound source is present. • Lavalier microphone data can be used for this purpose.

  16. Sketch of the recording and reproduction set-up ViMiC=Virtual Microphone Control (Multichannel sound spatialization software)

  17. Microphone Array • Microphone array was positioned in the center of the room, 186 cm above the ground. • consists of 5 omni-directional microphones (Earthworks M30). • square-based pyramid dimensions: base side: 14 cm, triangular side 14 cm.

  18. SNR Estimation

  19. Estimation of the signal-to-noise ratios for each sound source

  20. Test material • Material recorded during a theatre production. • Room acoustics was typical for a small theatre and not reverberation free. • Four actors were equipped with lavalier microphones.

  21. Analyses methods • programmed in MATLAB • A filter bank of five octave-band wide IIR filters (Chebyshev Type-I filters, 125 Hz to 2 kHz center frequencies. • A running time-window was applied (Hanning, 100-ms filter length). For each time/frequency bin, the signal-to-noise ratio was determined according to Eq. 1. • Sound source estimation (if SNR > 4 dB) • cross-correlation technique • information theoretic delay criterion (ITDC) algorithm [Mod:88]

  22. DOA for the left/right angle with the speed of sound c, the sampling frequency fs, the internal delay , and the distance between both microphones d DOA=Direction of Arrival

  23. Results for cross-correlation technique

  24. Results for cross-correlation technique

  25. Results for ITDC

  26. Results for ITDC

  27. Conclusions • Initial results promising. • Significant improvement was observed when the cross correlation method was replaced with the information theoretic delay criterion algorithm. • A real-time application will be implemented in future

  28. Literature • [Fri:80] Fritsch, F. N. and R. E. Carlson, Monotone Piecewise Cubic Interpolation, SIAM J. Numerical Analysis, Vol. 17, 1980, pp.238–246 • [Jot:92] Jot, J.-M. (1992) Étude et réalisation d'un spatialisateur de sons par modèls physiques et perceptifs, Doctoral dissertation, Télécom Paris. • [Mod:88] R. Moddemeijer, An information theoretical delay estimator, Ninth Symp. on Information Theory in the Benelux, May 26–27, 1988, Mierlo (NL), pp. 121–128, Ed. K.A. Schouwhamer Immink, Werkgemeenschap Informatie- en Communicatietheorie , Enschede (NL). • [Pul:97] Pulkki, V. (1997) Virtual sound source positioning using vector base amplitude panning}, J. Audio Eng. Soc. 45, 456–466. • [Wür:97] W. Würfel (1997) Passive akustische Lokalisation [passive acoustical localization], Master's Thesis, Technical University Graz.

  29. Outlook • Expand telematic Circle (contact: braasj@rpi.edu) • Integration of haptics into the transmission system • Get Acoustic Tracking system to work in real-time

  30. Room Model

  31. AVE Architecture

  32. AV integration Valente & Braasch, Acustica 2008

  33. virtual sound source virtual microphones floor

  34. Blumlein XY-Technique Signal left channel Signal right channel Angle of arrival Directivity pattern right microphone Directivity pattern left microphone

  35. Blumlein XY-Technique Signal left channel Signal right channel –45° Angle of arrival Directivity pattern right microphone Directivity pattern left microphone

  36. Blumlein XY-Technique Signal left channel Signal right channel –45° +45° Angle of arrival Directivity pattern right microphone Directivity pattern left microphone

  37. Blumlein XY-Technique Signal left channel Signal right channel –45° 0° +45° Angle of arrival Directivity pattern right microphone Directivity pattern left microphone

  38. Stereophony >> l=-30° r=30° a a

More Related