Bradley C. Lackey - Odenton MD, US Patrick J. Schone - Columbia MD, US Brenton D. Walker - College Park MD, US
Assignee:
The United States of America as represented by the Director, National Security Agency - Washington DC
International Classification:
G10L 15/06 G10L 15/08
US Classification:
704 8, 704 10, 704240, 704277
Abstract:
Method of recognizing phones in speech of any language. Acquire phones for all languages and a set of languages. Acquire a pronunciation dictionary, a transcript of speech for the set of languages, and speech for the transcript. Receive speech containing unknown phones. If the speech's language is unknown, compare it to the phones for all languages to determine the phones. If the language is known but no phones were acquired in that language, compare the speech to the phones for all languages to determine the phones. If phones were acquired in the speech's language but no corresponding pronunciation dictionary was acquired, compare the speech to the phones for all languages to determine the phones. If a pronunciation dictionary was acquired for the phones in the speech's language but no transcript was acquired then compare the speech to the phones for all languages to determine the phones.
Automatically Generating A Topic Description For Text And Searching And Sorting Text By Topic Using The Same
Douglas J. Nelson - Columbia MD Patrick John Schone - Elkridge MD Richard Michael Bates - Greenbelt MD
Assignee:
The United States of America as represented by the National Security Agency - Washington DC
International Classification:
G06F 1730
US Classification:
707531
Abstract:
A method of automatically generating a topical description of text by receiving the text containing input words; stemming each input word to its root form; assigning a user-definable part-of-speech score to each input word; assigning a language salience score to each input word; assigning an input-word score to each input word; creating a tree structure under each input word, where each tree structure contains the definition of the corresponding input word; assigning a definition-word score to each definition word; collapsing each tree structure to a corresponding tree-word list; assigning a tree-word-list score to each entry in each tree-word list; combining the tree-word lists into a final word list; assigning each word in the final word list a final-word-list score; and choosing the top N scoring words in the final word list as the topic description of the input text. Document searching and sorting may be accomplished by performing the method described above on each document in a database and then comparing the similarity of the resulting topical descriptions.