Pattern playback

Pattern playback

The Pattern playback [http://www.haskins.yale.edu/featured/patplay.html] [http://www.ling.su.se/staff/hartmut/kemplne.htm] is an early talking device that was built by Dr. Franklin S. Cooper and his colleagues, including John M. Borst and Caryl Haskins, at Haskins Laboratories in the late 1940s and completed in 1950. There were several different versions of this hardware device. Only one currently survives. The machine converts pictures of the acoustic patterns of speech in the form of a spectrogram back into sound. Using this device, Alvin Liberman, Frank Cooper, and Pierre Delattre (later joined by Katherine Safford Harris, Leigh Lisker, and others) were able to discover acoustic cues for the perception of phonetic segments (consonants and vowels). This research had a revolutionary effect on speech science and was fundamental to the development of our modern techniques of speech synthesis, the development of reading machines for the blind, and the study of speech perception and speech recognition.

To create sound, the Pattern Playback uses an arc light source which is directed against a rotating disk with 50 concentric tracks whose transparencies vary systematically in order to produce 50 harmonics of a fundamental frequency. The light is further projected against a spectrogram whose reflectance corresponds to the sound pressure level of the partial of the signal, and is then directed towards a photovoltaic cell by which the light variation is converted into sound pressure variations.

The Pattern Playback was last used in an experimental study by Robert Remez in 1976. The Pattern Playback now resides in the Museum at Haskins Laboratories in New Haven, Connecticut.

The technique of pattern playback also now refers, more generally, to algorithms or techniques for converting spectrograms, cochleagrams, and correlograms from pictures back into sounds.

Digital pattern playback

In the 1970s, digital pattern playbacks began to supplant the earlier version. In early prototype was developed at Haskins Laboratories which combined a "Ubiquitous Spectrum Analyzer" for automatic spectral analysis, along with a VAX GT-40 display processor for graphic manipulation of the displayed spectrogram, a form of "synthesis by art", and subsequent re-synthesis using an "Ove hardware synthesizer." This hybrid hardware/software digital pattern playback was eventually replaced at Haskins Laboratories by the HADES software system, designed by Philip Rubin, and implemented in Fortran on the Vax family of computers. A more modern version has been described by Arai and colleagues [http://yuichi.splab.ee.sophia.ac.jp/Digital_Pattern_Playback/] .

See also

* Caryl Haskins
* Haskins Laboratories
* Alvin Liberman
* reading machine
* Robert Remez
* Philip Rubin
* spectrogram
* speech perception
* speech synthesis

Bibliography

Cooper, F.S., Liberman, A. M., & Borst, J. M., The interconversion of audible and visible patterns as a basis for research in the perception of speech. "Proceedings of the National Academy of Science", 1951, 37, 318-325.

Cooper, Franklin S., Delattre, Pierre C., Liberman, A. M., Borst, J. M. & Gerstman, L. J. , Some experiments on the perception of synthetic speech sounds. "The Journal of the Acoustical Society of America", 1952, 24, 597-606.

Cooper, Franklin S., Some instrumental aids to research on speech. In "Report of the fourth annual round table meeting on linguistics and language teaching". Washington, D.C.: Institute of Languages and Linguistics, Georgetown University, 1953, 46-53.

J. M. Borst, The use of spectrograms for speech analysis and synthesis, "J. Audio Eng. Soc.", 4, 14-23, 1956.

Liberman, Alvin M., Some results of research on speech perception. "The Journal of the Acoustical Society of America", 1957, 29, 117-123.

Remez, Robert E., Adaptation of the category boundary between speech and nonspeech: A case against feature detectors. "Cognitive Psychology", 1979, 11, 38-57.

Malcom Slaney. Pattern Playback from 1950 to 1995. "Proceedings of the 1995 IEEE Systems, Man and Cybernetics Conference". October 22-25, 1995, Vancouver, Canada.

Malcolm Slaney, Pattern Playback in the 90's, in "Advances in Neural Information Processing Systems 7" , Gerald Tesauro, David Touretzky, and Todd Leen (eds.), MIT Press, Cambridge, MA, 1995.

T. Arai, K. Yasu and T. Goto, Digital pattern playback, "Proc. Autumn Meet. Acoust. Soc. Jpn"., 429-430, 2005.

T. Arai, K. Yasu and T. Goto, Digital pattern playback: Converting spectrograms to sound for educational purposes, "Acoust. Sci. & Tech.", 27(6), 393-395, 2006


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • playback head — the part of a tape recorder that is used to pick up the magnetic pattern on tape in order to play back material previously recorded. Also called reproduce head. [1945 50] * * * playback head, a magnetic head for playing back tape recordings …   Useful english dictionary

  • playback head — the part of a tape recorder that is used to pick up the magnetic pattern on tape in order to play back material previously recorded. Also called reproduce head. [1945 50] * * * …   Universalium

  • Haskins Laboratories — [http://www.haskins.yale.edu] is an independent, international, multidisciplinary community of researchers conducting basic research on spoken and written language. Founded in 1935 and located in New Haven, Connecticut since 1970, Haskins… …   Wikipedia

  • Vocaloid — 2 Editor (English version) Developer(s) …   Wikipedia

  • Speech synthesis — Stephen Hawking is one of the most famous people using speech synthesis to communicate Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented… …   Wikipedia

  • GoatTracker — GoatTrackerGoatTracker is a cross platform tracker written by Lasse Öörni, producing SID chiptune music for the Commodore 64, and released with source code under the GPL. It is notable for being possibly the only SID chiptune composer NOT native… …   Wikipedia

  • Yamaha QY10 — The Yamaha QY10 is a hand held music workstation produced by the Yamaha Corporation in the early 1990s. Possessing a MIDI sequencer, a tone generator and a tiny single octave keyboard, the portable and battery powered QY10 enables a musician to… …   Wikipedia

  • Computer Audition — (CA) is general field of study of algorithms and systems for audio understanding by machine. Since the notion of what it means for a machine to hear is very broad and somewhat vague, computer audition attempts to bring together several… …   Wikipedia

  • Microsoft Agent — Microsoft provides examples on its website for the use of Agent. Microsoft Agent is a technology developed by Microsoft which employs animated characters, text to speech engines, and speech recognition software to enhance interaction with… …   Wikipedia

  • Currah — was a British computer peripheral manufacturer, famous mainly for the speech synthesis cartridges it designed for the ZX Spectrum, Commodore 64, and other 8 bit home computers of the 1980s. Contents 1 Currah μSource for the ZX Spectrum 2 Currah… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”