INTERNATIONAL BUSINESS MACHINES CORP. (IBM)
Armonk, New York 10504
CONTENTS:
HISTORY
FORMANT SYNTHESIS USING DIPHONE CONCATENATION (1961-1985)
COMPUTER VOICE OUTPUT FROM STORED WORDS (1965-1971)
BIOGRAPHIES
------------------------------------------------------------- Top
HISTORY:
IBM started in 1914 as the Computing-Tabulating-Recording Co., a
conglomerate of companies, some dating back to the late 1800s.
Initial products were commercial scales, meat and cheese slicers,
industrial time-recording equipment, tabulators, and punched cards.
By 1924, the company name had been changed to International
Business Machines Corporation, and a line of business based on a
printing calculator had begun to dominate its sales. This line of
business developed into modern data processing.
Evolution of data systems from batch processing to interactive
systems stimulated interest in techniques for improving the man/
machine interface. One of these techniques was seen to be speech
synthesis for language translators, reservation systems, office
business systems, and aids for people with impaired vision.
------------------------------------------------------------- Top
PROJECT: FORMANT SYNTHESIS USING DIPHONE CONCATENATION (1961 - 1985)
The project was started at the IBM Research Laboratory in San
Jose, California, for the purpose of developing a practical speech
synthesizer as a computer output device. A synthesis approach
based on assembling words from stored diphone segments was chosen
because of the potential reduction in required computer storage
over storing whole words, and the expectation that segment
assembly would require less real-time processing than a synthesis
by rule method. Use of a separate analog synthesizer avoided the
computer processing necessary to construct the speech wave, as
complete simulations at that time were taking 25-times real time
on a large computer, an IBM 7090. Also, the function generator
and synthesizer offered immediate response, whereas complete
computer simulation would have required overnight processing.
IBM's use of online interactive processing with graphic displays
took another decade to develop.
Synthesis was initially seen merely as a method of insuring
smooth juncture between stored segments, impossible to do with
prerecorded human speech segments. Later understanding of the
Haskins Laboratories experiments (SSSHP USA Haskins Laboratories)
focused the work on using formant transitions between segments
as a major parameter of intelligibility. Formant synthesis,
rather than articulatory synthesis, was chosen because it was
thought the control data could be obtained from a sound spectro-
graph or an electronic formant-tracker.
The techniques and recordings of the project will be summarized in
the following three sections:
TASS-II (1961-1966)
Diphone synthesis from phonetic spelling in punched cards.
Formant synthesizer implemented with analog circuits and
operated from a 7-channel analog function generator or from
a digital computer via digital tape and D/A converter.
TASS-III (1967-1970)
Diphone synthesis from phonetic spelling, online diphone
creation and assembly on graphic display. Formant synthesizer
implemented with operational amplifiers and controlled by an
on-line computer with graphic display.
TASS-IV (1980-1985)
Formant synthesizer simulated and controlled by an on-line
computer with graphic display. Goal of text-to-speech using
diphone concatenation.
------------------------------------------------------------- Top
TASS-II, Terminal Analog Speech Synthesizer-II (1961-1966)
IBM Research Laboratory, San Jose, California
A cascade, or series, formant synthesizer operated from a multi-
channel function generator or from an IBM 7090 computer via an
off-line D/A converter (photos). The synthesizer was modeled after
the OVE synthesizers in Sweden (see SSSHP SWEDEN Royal Institute
of Technology) and the PAT synthesizer in England (see SSSHP UK
SRDE.)
In 1962 the first synthesizer, TASS-II , became operational with
the multi-channel function generator. Late in the year, additions
to the function generator allowed the patterns to be sampled at
10-millisecond intervals and punched into cards for entry into the
IBM 7090 computer. The stored values could then be assembled by
the computer in any specified order and the resulting values output
to a digital tape. Addition of a computer digital tape unit and a
D/A converter allowed the digital tape to control the speech syn-
thesizer.
Function Generator Synthesis. Speech synthesis from manually pre-
pared patterns of whole sentences, read by the function generator.
The control patterns for the function generator were constructed on
6-inch wide transparent plastic belts with 1/16-inch black masking
tape (developed for photoetching circuit boards), that could be
easily changed with scalpel and tweezers. The patterns were then
taped to a transparent, endless mylar belt to be carried past a
flying-spot scanner (photos). This function generator design was
believed to offer improvement in flexibility and ease-of-use over
the previous designs for multi-channel function generators using
conducting ink (see SSSHP UK University College London.)
A partial control pattern and sound sample (.wav, 67kB)
SSSHP 150.1 Tape: "IBM DIPHONE SPEECH SYNTHESIS (1961-1970)"
(syn: "Inventory Number..."; "Now this is Old...";
"We wish you a Merry Christmas...", 1, 2, and 3-part
harmony)
Cassette, good quality, copied from SSSHP 131, 135
Diphone Synthesis. The synthesizer control patterns were stored
digitally in the computer in short units, mainly as consonant-
vowel and vowel-consonant pairs. H.M. Truby coined the name
"Diphone" for these segments.
1963 Estes, S. E., H.R. Kerby, H.D. Maxey, and R.M. Walker,
"Speech synthesis from stored data", IBM J. Research and
Development, 8, 2-12 (1964). Description of synthesizer,
function generator, and diphone synthesis techniques.
(SSSHP 151 IBM Reprints) (B,I,K)
SSSHP 150.2 Tape: "IBM DIPHONE SPEECH SYNTHESIS (1961-1970)"
(7 diphone sen: "Which is his hat? ... eye for an eye.")
Cassette, good quality
1963 Maxey, H.D., "Terminal Analog Synthesis of Voiced Fricatives,"
J. Acoust. Soc. Am. 35, 1890 (A). Amplitude modulation of the
noise source to improve naturalness. Unpublished paper. (Copy
in SSSHP 151 IBM Reprints.) (K)
1964 Truby, H.M., "Duration as an alternate synthesis-parameter
for intensity and vowel-quality", Proc 5th Int. Congr.
Phonetic Sci. (Munster, 1964), Basel: Karger, 1965,
551-555. Description and naming of diphones. Effect of
vowel duration on perceived intensity and quality.
(SSSHP 151 IBM Reprints)
Diagnostic Intelligibility Tests. By 1965, it had become apparent
that familiarization with the synthetic material limited the
research staff's ability to make further improvements in the
synthesis. To solve that problem, formal intelligibility tests
were implemented, using naive listeners from a local university.
Initial test material was a synthetic version of the Central
Institute for the Deaf intelligibility test CID W22, using
phonetically-balanced words.
Analysis of the tests suggested which diphones needed to be
improved, and which diphones were being confused. By the end of
the year, intelligibility scores were in the range of 80% to 85%
of the intelligibility of human speech. This was one of the
first uses of intelligibility tests to diagnose problems with
speech synthesis.
1965 Dixon, N.R., "Some considerations in testing the intelligi-
bility of synthetic speech material", IBM Research Lab.,
San Jose, California, November 1965. IBM Confidential,
declassified Jan. 30, 1991. Use of CID-W22 lists in intelli-
gibility testing, effects of word and diphone familiarity,
techniques of diphone synthesis, test procedures, results.
(SSSHP 151 IBM Reprints)
Also presented as:
Dixon, N.R., H.D. Maxey and R. Aylsworth, "Some effects of
list familiarity and synthetic segment familiarity on the
intelligibility of PB-words produced by diphone synthesis,"
70th meeting of the Acoust. Soc. Amer. (St. Louis, MO, Nov.
3-6, 1965.)
SSSHP 150.3 Tape: "IBM DIPHONE SPEECH SYNTHESIS (1961-1970)"
(Definition of diphones & training words, 5 training
phrases, description of CID test material, 5 test
phrases)
Cassette, good quality, copied from SSSHP 134
Multiallophonic Diphone Synthesis. By 1965 it had become apparent
that diphones created in isolation were too stressed when used
within words and phrases. For that reason work during the years
1965 and 1966 addressed the use of diphones in multi-allophonic
roles. Diphones were synthesized initially in a meaningful word,
but stylized for subsequent use in other allophonic roles by later
modification of duration and intonation.
1966 Dixon, N.R., and H.D. Maxey, "Terminal analog synthesis of
continuous speech using the diphone method of segment
assembly", IEEE Trans. Audio and Electro., AU-16, 40-50
(1968). (I,K) Discussion of the theory of diphone
construction for multiallophonic roles in continuous speech.
Description of TASS-II system with function generator or IBM
7094 computer control. Test procedures and results of
intelligibility tests using 64 words from telephone-intercept
applications. "SWB" refers to the "Southwest Bell Telephone
sentences." (SSSHP 151 IBM Reprints)
A tape was prepared, narrated by N.R. Dixon, containing
examples of sentence synthesis from the function generator,
diphone words, and diphone sentences (diphone spelling).
SSSHP 150.4 Tape: "IBM DIPHONE SPEECH SYNTHESIS (1961-1970)"
("Inventory No. 901, Series A." (control variations);
10 words:"office, being, consult, ... W";
7 sen: "The number you dialed ... and dial again.")
Cassette, good quality, copied from SSSHP 135
SSSHP 32.18 Tape to accompany "Review of Text-to-speech con-
version for English," D.H. Klatt, JASA 82.3, Sept. 1987.
(5 sen: "The number you dialed ... and dial again")
Cassette, Klatt MIT A/D and D/A
ARTIFACTS AND RECORDS:
ARTIFACT: TASS-II synthesizer and function generator were
discarded after TASS-III became operational.
SSSHP 152 IBM TASS-II SPEECH SYNTHESIS SYSTEM CIRCUIT
DIAGRAMS. Original circuit diagrams and design notes
for TASS-II. Three-ring binder for 8.5" x 11" pages.
SSSHP 153 IBM TASS-II VOICING SPECTRUM
Calculations and measurements of TASS-II frequency
response and spectrum. Three-ring binder for 8.5" x
11" pages.
SSSHP 154 IBM SPEECH SYNTHESIS DIPHONE SEGMENT DATA
Development data for Diphone Libraries 1 to 5. Dated
Oct 30, 1962 to Jan 8, 1970. Diphone-assembly rules
for Lib 4. Three-ring binder for 8.5" x 11" pages.
SSSHP 155 IBM TASS-II ELECTRONIC CHARACTERIZATION (Index)
Photocopies of selected circuits and charts for use as
reproduction-masters.
SSSHP 156 IBM TASS-II PHOTOGRAPHS (and online)
SSSHP 157 IBM TASS-II FUNCTION GENERATOR PATTERNS (originals)
Original plastic patterns for:
Master for making blank Diazochrome patterns
"Inventory Number 901, Series A." (1965)
"We wish you a Merry Xmas and a Happy New Year." (1965)
"Now this is 'Old Ironjaw' talkin'." (1966)
SSSHP 158 IBM TASS-II FUNCTION GENERATOR PATTERNS (Diazochrome)
Contact copies on Diazochrome film of original plastic
pattern for:
"Calibration pattern" (1964)
"Speech Synthesis" and "IBM 1401" (1964)
"And a Happy New Year" (1965)
"We wish you a Merry Christmas" 1st Stanza (1965)
" " " " " 2nd Stanza (1965)
" " " " " 3rd Stanza (1965)
"We wish you a Merry Christmas and a Happy New Year (1965)
"Inventory 901, Series A" (1965)
SSSHP 159 IBM TASS-II FUNCTION GENERATOR PATTERN PHOTOCOPIES
Photocopies of several function generator patterns on
acid-free paper. Contours on the older photocopies have
been darkened by hand for better reproduction. The long
patterns required up to five photocopies, in which the
photocopies overlap. Note that the first two patterns
have different F0 and F3 scales. (Originals of the last
three are in SSSHP 157.)
37. Notes on original function generator patterns.
34A. "Hello, how are you." - 1962
34B. "Mary had a little lamb." - 1962
34C-E. "Inventory Number 901, Series A." - 1965
34F-K. "We wish you a Merry Xmas and a Happy New Year."
- 1965
34L. "Now this is 'Old Ironjaw' talkin'." - 1966
------------------------------------------------------------- Top
TASS-III, Terminal Analog Speech Synthesizer-III (1967-1970)
At the end of 1966, N.R. Dixon and H.D. Maxey moved the project to
the IBM Advanced Technology Department, IBM Corp., Research Triangle
Park, NC, for further development as a possible product. The library
of diphone segments for TASS-II (Library 4) was not preserved sepa-
rately, but was migrated into the TASS-III Diphone Library 5 (see
below.)
TASS-III was an evolution of the TASS-II synthesizer to provide for
direct digital control by the computer, circuit designs better suited
for products, and enhancements to naturalness of the synthetic
speech. The calibration of the TASS-II synthesizer was preserved
as much as possible so that the diphone library from TASS-II
could be converted for use with TASS-III. Sufficient data has
been included with this history to allow diphone synthesis to
be reconstructed by simulation.
TASS-III was implemented as an analog formant synthesizer, built
with operational amplifiers, but operated digitally from direct
attachment to an IBM S/360 computer. A graphics display replaced
the previous multi-channel function generator.
During this period, N.R. Dixon was able to improve the method of
diphone segmentation and assembly, but text-to-speech capability
was not reached. The speech synthesis project was terminated
during 1970 because of insufficient commercial application to
continue the funding.
SSSHP 150.5 Tape: "IBM DIPHONE SPEECH SYNTHESIS (1961-1970)"
("Speech Processing..."; "Sock it to..."; "An heah...";
Schlitz Beer song; Japanese: "Ohayoo...", "Dr. Miura...",
"O Genki..."; German: "Spraken ze...")
Cassette, good quality, copied from SSSHP 138, 139
1966 Dickerson, J.A., A Digitally Controlled Formant Generator for
a Terminal Analog speech Synthesizer, Technical Report
TR29.0043, IBM Systems Development Division, Research
Triangle Park, North Carolina, March 1969, 9 pp. Design and
circuit diagram for the TASS-III prototype.
(SSSHP 151 IBM Reprints)
1970 Maxey, H.D., "Terminal-Analog Synthesis System III", Speech
Processing Technology, IBM Research Triangle Park, NC., May,
1970. Selected pp. 25-60, all figures and tables from unpub-
lished draft. Design considerations for TASS-III. Detailed
description of TASS-III system with circuit transfer functions
(s-plane equations), calibration values, and a description of
the sound room used for testing. (SSSHP 151 IBM Reprints)
RECORDS:
TASS-III DIPHONE DATA
- Index: 631 diphones, 50 words (ss_ibmdl.txt 19KB)
- Rules for Diphone Sentences (ss_ibmdr.txt 10KB)
- IBM Diphone-Synthesis Sentences (49) (ss_ibmds.txt 44KB)
- Diphone Library 5B data (ss_ibm5b.zip 52KB)
SSSHP 160 Computer Diskette: IBM TASS-III DIPHONE DATA
IBM PC 1.44M diskette, formatted by IBM DOS 7.0
- Index: 631 diphones, 50 words (ss_ibmdl.txt 19KB)
- Rules for Diphone Sentences (ss_ibmdr.txt 10KB)
- IBM Diphone-Synthesis Sentences (49) (ss_ibmds.txt 44KB)
- Diphone Library 5B data (ss_ibm5b.txt 197KB)
SSSHP 161 IBM TASS-III DIPHONE LIBRARY 5B - printed
Printed on acid-free paper:
- Index: 631 diphones, 50 words 5 pp.
- Rules for Diphone Sentences 4 pp.
- IBM Diphone-Synthesis Sentences (49) 13 pp.
- Reconstruction Notes 1 p.
- Diphone Library 5B data 30 pp.
If necessary within about the next 200 years, the esti-
mated life of the paper, the electronic diphone library
5B can be reconstructed by scanning the printed pages
and using optical character recognition (see included
Reconstruction notes.)
SSSHP 162 IBM TASS-III ORIGINAL CIRCUIT DIAGRAMS
Original circuit diagrams and design notes for TASS-III.
SSSHP 163 IBM TASS-III ELECTRONIC CHARACTERIZATION (Index)
Photocopies of selected circuits and charts for use as
reproduction-masters.
SSSHP 164 Programs: "DOS Speech Synthesis Assembly Listings,
Copy 2, 31 Aug 72", F. Easterly, 12"x15", IBM. Binder
of computer listings of source code for S/360 assembly
language programs for the IBM S/360 M40 DOS computer.
Programs operated the 2250 graphics display, assembled
and modified diphones, and output control data to the
IBM TASS-III synthesizer. Diphone-assembly logic starts
on leaf 17 for 12 pages.
SSSHP 149 IBM - Maxey Tape Collection, Card Index and Tape
Transcription Notes
------------------------------------------------------------- Top
TASS-IV: TEXT-TO-SPEECH USING DIPHONE CONCATENATION (1980-1985)
IBM Thomas J. Watson Research Center, Yorktown Heights, New York.
A research project to revive the diphone synthesis work after ten
years. The goal was a completely simulated text-to-speech system.
N. R. Dixon, by then at the T. J. Watson Research Center, contrib-
uted the Diphone Library 5B of 11/11/70, a simulation of the
TASS-III speech synthesizer, and various experimental programs.
No record available of text-to-speech. (Letter of permission from
IBM in SSSHP USA IBM file.)
1984 Klavans, J., J. Nartey, C. Pickover, D. Reich, M.B. Rosson,
and J. Thomas, "Walrus: High-Quality Text-To-Speech Research
System", Proc. IEEE Speech Synthesis and Recognition, pp.
19-28, 1984.
1985 Pickover, Clifford A., TUSK: A Versatile Graphics Workstation
for Speech Research, RC 11497, IBM Thomas J. Watson Research
Center, Yorktown Heights, NY, 46 pp., 1985. Experimental
speech synthesis system description, diphone assembly
algorithms. (Copy in SSSHP 151 IBM Reprints.)
SSSHP 117 Tape: "IBM Diphone Speech Synthesis, WALRUS System,
circa 1984."
("To be or not to be", var; "My name is David", voice
var; Hallelujah chorus, var)
Cassette, good quality
1985 Reich, David L., A WALRUS COOKBOOK, IBM T. J. Watson Research
Center report RC 11161, 12 pp., 1985. User's guide for develop-
ment text-to-speech system. Synthesis using an improved diphone
segment library and algorithms. Marked "IBM Internal use only",
but permission granted to Smithsonian. (Copy in SSSHP 151 IBM
Reprints.)
------------------------------------------------------------- Top
PROJECTS: COMPUTER VOICE OUTPUT FROM STORED WORDS (1965 - 1971)
Devices for modifying and assembling stored words to form messages
for computer voice output.
IBM Voice Answer-Back System
Messages assembled from words stored on a magnetic drum and output
simultaneously to 200 telephone lines. Words time compressed to
either 300 or 600 ms. Installed at the New York Stock Exchange.
1965 DuBois, G.E., "Audio Response for the New York Stock
Exchange," IEEE Conv. Record, March, 1965.
Tape ?
IBM 7770 Audio Response Unit
Commercial version of Voice Answer-Back System. Word time slots
of 500 ms on a magnetic drum, 128 words maximum, available to 48
telephone lines simultaneously. Human speech was manually edited
to fit portions of words or phrases into the fixed 500 ms time
slots.
1971 Chapman, W.D., "Techniques for computer voice response",
IEEE Intern. Conv. Record, Paper 2F.2, 98-99 (1971) (B)
SSSHP 140 Tape: "IBM 7770, 1969", 5" reel, 7.5 ips. Donated
by IBM Corp. (Maxey Tape T69.6)
(samples of digitized human speech for from IBM 7770-III
for NY Bell Telephone Co.)
IBM 7772 Audio Response Unit
Pitch excited channel vocoder controlled by a computer. One
synthesizer per telephone line. 1000-word business vocabulary of
processed human speech.
1967 Buron, R., "Generation of a 1000 word vocabulary for a pulse
excited vocoder operating as an audio response unit", 1967
Conf. on Speech Comm. and Proc, MIT, Cambridge MA, Session
A1, Nov 6-8, 1967, and IEEE Trans on Audio and Electro, Vol.
AU-16, No. 1, 21-25, March 1968.
SSSHP 81.6c Tape: "Speech Analysis/Synthesis Demonstration,
Copy No. 2-9, T67.2"
(male syn:"Today is Tue, Nov 7. One, two, ...,six.")
(fem syn:"One, two, ..., six.", French phrase)
7" reel, good quality
------------------------------------------------------------- Top
BIOGRAPHIES
RAY AYLSWORTH
1957 B.S. in electrical engineering, Texas Technological College,
Lubbock TX
1959 S.M. in electrical engineering, Massachusetts Institute of
Technology, Cambridge MA
Advanced Technology Dept., IBM General Products, San Jose CA,
photoconductor and magnetic material studies
1960 Research and Engineering Div., Lockheed Missiles and Space
Co., Sunnyvale CA ?, telecommunications systems
1963 E.E. in electrical engineering, Stanford University, Palo
Alto CA
1964 Communication Sciences Dept., IBM San Jose Research
Laboratory, San Jose CA, speech synthesis
RENÉ BURON
1959 Engr. Degree, École Polytechnique, Paris
1961 Centre d'Etudes et Recherches, IBM World Trade Corp.,
La Gaude, France
WILLIAM D. CHAPMAN
1954 B.S.E. in electrical engineering, Univ. of Michigan, Ann
Arbor MI
1956 M.S.E. in electrical engineering, Univ. of Michigan
1959 M.S. in communications sciences, Univ. of Michigan
Communications Sciences Laboratory, Univ. of Michigan, speech
analysis and processing
1961 IBM Corp., Poughkeepsie NY, speech recognition, speaker
verification, computer voice output
1965 Mgr., IBM Advanced Technology Speech Processing Dept.,
Research Triangle Park NC, speech synthesis and recognition,
speaker verification, computer voice output
19 Deceased
JACK A. DICKERSON
1956 Diploma, Electronics Institute of Technology, Detroit, MI
1956 IBM, Poughkeepsie, NY
1966 IBM, Research Triangle Park, NC, speech synthesizer design
1990 Retired from IBM
N. REX DIXON
1958 B.A. in speech pathology and psychology, Western Michigan
University, Kalamazoo MI
1960 M.A. in speech pathology and audiology, Indiana University,
Bloomington IN
1960-61 Cornell University, Ithaca NY, teacher of anatomy and
physiology of speech and hearing
1963 Communication Sciences Dept., IBM San Jose Research,
San Jose CA, speech synthesis
1966 Ph.D. in speech and hearing sciences, Stanford University,
Palo Alto CA
1967 IBM Advanced Technology Speech Processing Dept., Research
Triangle Park NC, speech synthesis and speech recognition
1972 IBM Thomas J. Watson Research Center, Yorktown Heights, NY,
speech recognition and speech synthesis
1983 IBM Journal of Research and Development, Armonk, NY, Editor
following year
1989 retired from IBM
G. E. DUBOIS
W. FRED EASTERLY, JR.
1950-51 Bob Jones University, Greeneville, SC
1951-56 Teletype maintenance, U.S. Air Force
1956-57 Cours de Civilization Francaise, Sorbonne, France
1960 Diploma, Radio Electronics Television School, Detroit, MI
1960 IBM, Poughkeepsie, NY
1966 IBM Advanced Technology Speech Processing Dept, Research
Triangle Park, NC
1991 Retired
SAMUEL E. ESTES
1957 B.S. in electrical engineering, Texas Technological College
1958 S.M. in electrical engineering, Massachusetts Institute of
Technology
1958 Sandia Corporation, Albuquerque, NM, automatic data collection
1959 IBM Advanced Systems Development, San Jose, CA, digital data
transmission over telephone networks
1960 IBM San Jose Research, San Jose, CA, magnetic recording
theory, speech synthesizers, speech recognition
1965 Ph.D. in electrical engineering, Stanford University
1965 IBM T.J. Watson Research Laboratory, Yorktown Heights, NY
1968 IBM, Research Triangle Park, NC
1972 IBM Thomas J. Watson Research Laboratory, Yorktown Heights, NY
1976 Deceased
H. RAY KERBY
1957 B.S. in electrical engineering, Texas Technological College,
Lubbock, TX
1957 IBM San Jose Research Laboratory, San Jose, CA, digital magnetic
recording, speech synthesis, photodigital memory
1960 M.S. in electrical engineering, Stanford University, Palo
Alto, CA
1967 Manager, IBM Advanced Technology Dept., Research Triangle Park, NC
. . .
1993 Retired from IBM
JUDITH L. KLAVANS
1968 B.A., Spanish & Mathematics, Oberlin College
1976 M.A., Linguistics, University of London
1980 Ph.D., Linguistics, University of London
1980 Post-doctoral Fellow, Massachusetts Institute of Technology
1983 Research Staff Member, IBM T. J. Watson Research Center,
Yorktown Heights, NY
1993 Research Scientist, Computer Science Department, Columbia
Univ., New York
1995 Dir., Center for Research on Information Access, Columbia Univ.
H. DAVID MAXEY
1957 B.S. in electrical engineering, Texas Technological College,
Lubbock, TX
1957 IBM San Jose Research Laboratory, San Jose, CA, digital magnetic
recording, speech synthesis
1960 M.S. in electrical engineering, Stanford University, Palo
Alto, CA
1967 IBM Advanced Technology Dept., Research Triangle Park, NC,
speech synthesis
1970-71 Fellow, Advanced Study Program, Massachusetts Institute of
Technology, Cambridge, MA
1971 IBM Systems Development Laboratory, Research Triangle Park
NC, performance analysis of computer communication systems
1986 Collaborator, Smithsonian Institution, technical direction of
Smithsonian Speech Synthesis History Project
1991 retired from IBM
JONAS NARTEY
CLIFFORD A. PICKOVER
1978 B.A., Biology, Franklin & Marshall College, Lancaster, PA
1982 Ph.D., Molecular Biophysics and Biochemistry, Yale University
1982 Research Staff Member, IBM T. J. Watson Research Center
DAVID L. REICH
MARY BETH ROSSON
1977 B.A., Psychology, Trinity University, San Antonio, Texas
1977 Statistician, Southwest Research Institute, San Antonio, Texas
1981 Human Factors Psychologist, Human Factors Dept., IBM Austin
1982 Ph.D., Human Experimental Psychology, University of Texas, Austin
1982 Computer Science Dept., IBM T.J. Watson Research Center
1994 Assoc. Prof., Computer Science Dept., Virginia Polytechnic
Institute and State University
JOHN C. THOMAS
1967 B.A., Education, Case-Western Reserve University
1967 Doctoral student, University of Michigan
1971 Ph.D., Experimental Psychology, University of Michigan
1971 Research Assoc., Harvard Medical School and Mass. General Hospital.
1973 Research Staff Member, IBM T.J. Watson Research Center,
office systems evaluation
1980 Staff, Chief Scientist's Office, IBM Corporation
1982 Project Manager, IBM T.J. Watson Research Center, speech
synthesis
1986 Dir., Artificial Intelligence Lab, NYNEX Science & Technology
1994 Executive Director, NYNEX Science & Technology
1997 Executive Director, Bell Atlantic Science & Technology
1998 Manager, Knowledge Socialization, IBM T.J. Watson Research
Center
H. M. TRUBY
1941 B.S. and B.A., University of Texas, Austin TX
1949 M.S. and M.A., University of Wisconsin
1952-54 Research Associate, Haskins Laboratories, NY
1954-55 Visiting Lecturer in Linguistics, Kiel University
1955-60 Research Associate, Speech Transmission Laboratory, Royal
Institute of Technology, Stockholm
1956-62 Lecturer, University of Stockholm
1957-62 Research Associate, Wenner-Gren Cardiovascular Research
Foundation
1959 Ph.D. in phonetics, University of Lund, Sweden
1961-62 Principal Investigator, United Cerebral Palsy grant
1962-65 IBM San Jose Research, San Jose CA, speech synthesis and
speech recognition
. . .
1993 Deceased, South Miami, FL
ROBERT M. WALKER
19 radio engineer, Seattle WA, directional antenna designs
1942 MIT Radiation Laboratory, radar displays
1946 IBM Watson Scientific Computing Laboratory, Columbia Univ.,
New York, NY, analog computing, computer electronics
1960 Manager, Communication Science Dept., IBM San Jose Research,
San Jose, CA, speech synthesis and recognition
1967 Deceased, Monte Sereno, CA
------------------------------------------------------------- Top
REVIEWED BY:
N. Rex Dixon
H. David Maxey, SSSHP Editor
IBM Corp., retired
|