kevin knight's home page
kevin knight's home page
kevin knight
usc/information sciences institute,
4676 admiralty way,
marina del rey, ca 90292,
(310) 448-8716,
e-mail:
senior research scientist and fellow, information sciences institute (isi).
research associate professor at the university of southern california, department of computer science.
research interests include: artificial intelligence, natural language
processing, semantics, machine translation, unification, machine learning,
heuristic search, automata theory, decipherment.
my publications.
my vita.
i'm involved in natural language research at usc/isi, where
we have a range of
projects.
i think our approach to syntax in machine translation is best
described in d. barthelme's short story
they called for more structure....
those interested in the syntax decoder search algorithm should listen to
an answer can be found.
you can download
tiburon,
a new tree toolkit developed by jonathan may. check it out!
you can also download jonathan graehl's very fine
finite-state string toolkit
carmel.
here's a paraphrase corpus aligned
by tagyoung chung. two translations of "20,000 leagues under
the sea" are manually matched up at the paragraph level.
corpus 1.
corpus 2.
correspondences.
the correspondence file comes in two columns; the first column
locates a line in corpus 1, and the second column locates
a line in corpus 2.
i am teaching this fall
cs562
(empirical methods in natural language processing), which i have
previously taught in fall 2000, spring 2002, spring 2003,
fall 2003, fall 2004, and fall 2006.
my
laptop
is a big mess.
i recently said this out loud: "i want a four-door convertible like the
one lincoln was assassinated in."
check out these
slides
from a talk i gave on the voynich manuscript.
i have been fortunate to work with great students. phd graduates include
yaser al-onaizan (whose thesis was on named-entity translation,
and who is now a researcher at ibm t. j. watson), kenji
yamada (whose thesis was on syntax-based statistical machine
translation, now at language weaver),
irene langkilde-geary (whose thesis was on language generation,
now at university of brighton), and
philipp koehn (whose thesis was on noun phrase
translation and is now on the faculty at university of
edinburgh). i also supervised ishwar chander's phd thesis on
statistical article generation. after that, i
co-advised shou-de lin's phd (now on the faculty at
the national university of taiwan), and bryant huang
did an excellent masters thesis on syntax-structuring for translation.
i recently went to denmark. here's me in
denmark.
here's a story called
little house
on the west bank. no matter how readers interpret the text,
my favorite part is still where the baby throws up on the mom.
here is a
slide show
that explains the global situation to american people.
i'm possibly the least-skilled person at drawing ever to get paid for it,
for example:
actual conversations ...
strange things ...
parking structures ...
werld maps.
fortune-telling
cards.
a machine
translation glossary for those in the field.
my idol is fred jelinek.
here's a big and free
french/english parallel corpus
ulrich germann built.
don't click here.
these are my kids
angela
and
victoria,
at least as they were several years ago when i still knew
how to work the photo scanner. i like how babelfish translates
this part of my home page to spanish: "estos son mis cabritos
angela and victoria..." ("these are my little goats...")
i wrote a
book
for little kids, and another
book
for big kids.
i used to be an
amber freak, a long time ago, and kept myself
amused.
since then, three of my favorite writers are philip k. dick, jack womack,
and jim thompson. i've read thompson's pop 1280 a few times in a row now.
that is a god-danged good book. other favorites: a scanner
darkly, man in the high castle, confessions of a crap artist, puttering
about in a small land, now wait for last year, the penultimate truth,
ambient, terraplane, heathern, elvissey, random acts of senseless
violence, and let's put the future behind us. i like also don delillo,
david lodge, kurt vonnegut, and j. d. salinger. seems like the first
couple of books are always better than all the rest.
i like bob dylan's music and found his life/autobiography (chronicles)
strangely similar neil armstrong's (first man). also: tao te ching,
and benjamin tucker's instead of a book.
selected publications
(well, ok, i went ahead and selected them all)
2007
"syntactic re-alignment models for machine translation",
(j. may and k. knight), proc. emnlp-conll, 2007.
get paper in pdf.
"what can syntax-based mt learn from phrase-based mt?"
(s. deneefe, k. knight, w. wang, d. marcu),
proc. emnlp-conll, 2007.
get paper in pdf.
"binarizing syntax trees to improve syntax-based machine translation accuracy,"
(w. wang, k. knight and d. marcu),
proc. emnlp-conll, 2007.
get paper in pdf.
2006
"statistical syntax-directed translation with extended domain of
locality" (l. huang, k. knight, a. joshi), proc. amta (poster), 2006.
get paper in pdf.
"tiburon: a weighted tree automata toolkit" (j. may and k. knight),
proc. international conference on implementation and application
of automata (ciaa), lecture notes in computer science
(springer), v. 4094/2006, 2006.
get paper in pdf.
"building an english-iraqi arabic machine translation system
for spoken utterances with limited resources" (j. riesa,
b. mohit, k. knight, d. marcu), proc. interspeech, 2006.
get paper in pdf.
"unsupervised analysis for decipherment problems" (k. knight,
a. nair, n. rathod, and k. yamada), proc. acl-coling (poster), 2006.
get paper in pdf.
"scalable inference and training of context-rich syntactic models"
(m. galley, j. graehl, k. knight, d. marcu, s. deneefe, w. wang,
and i. thayer), proc. acl-coling, 2006.
get paper in pdf.
"discovering the linear writing order of a two-dimensional
ancient hieroglyphic script" (shou-de lin and kevin knight),
artificial intelligence, volume 170(4-5), 2006.
get draft in pdf.
"a better n-best list: practical determinization of weighted
finite tree automata",
(j. may and k. knight), proc. naacl-hlt, 2006.
get paper in pdf.
"capitalizing machine translation",
(w. wang, k. knight, d. marcu), proc. naacl-hlt, 2006.
get paper in pdf.
"synchronous binarization for machine translation",
(h. zhang, l. huang, d. gildea, k. knight), proc. naacl-hlt, 2006.
get paper in pdf.
"relabeling syntax trees to improve syntax-based machine
translation quality",
(b. huang and k. knight), proc. naacl-hlt, 2006.
get paper in pdf.
2005
"transonics: a practical speech-to-speech translator for
english-farsi medical dialogs" (robert belvin, emil ettelaie,
sudeep gandhe, panayiotis georgiou, kevin knight, daniel marcu,
scott millward, shrikanth narayanan, howard neely, david traum),
proc. acl poster/demo, 2005.
"interactively exploring a machine translation model"
(steve deneefe, kevin knight, and hayward h. chan),
proc. acl poster/demo, 2005.
get paper in pdf.
"machine translation in the year 2004", (k. knight, d. marcu),
proc. icassp, 2005.
get paper in pdf.
"an overview of probabilistic tree transducers for natural
language processing", (k. knight, j. graehl),
proc. of the sixth international conference on intelligent
text processing and computational linguistics (cicling),
lecture notes in computer science,
copyright springer verlag, 2005.
get
paper in postscript.
get
paper in pdf.
2004
"text simplification for information seeking applications",
(b. beigman klebanov, k. knight, d. marcu),
in: on the move to meaningful internet systems,
eds. r. meersman and z. tari,
lecture notes in computer science (3290),
copyright springer verlag, 2004.
"what's in a translation rule?",
(m. galley, m. hopkins, k. knight, d. marcu), proc. naacl-hlt, 2004.
get paper in pdf.
"training tree transducers",
(j. graehl, k. knight), proc. naacl-hlt, 2004.
get paper in pdf.
"the transonics spoken dialogue translator: an aid for english-persian doctor-patient interviews",
(s. narayanan, s. ananthakrishnan, r. belvin, e. ettaile, s. gandhe, s. ganjavi, p. g. georgiou, c. m. hein, s. kadambe, k. knight, d. marcu, h. e. neely, n. srinivasamurthy, d. traum, and d. wang),
aaai fall symposium, 2004.
2003
"syntax-based language models for machine translation"
(e. charniak, k. knight, and k. yamada), proc. mt summit ix, 2003.
get paper in pdf.
"syntax-based alignment of multiple translations: extracting
paraphrases and generating new sentences" (b. pang, k. knight,
and d. marcu), proc. naacl-hlt, 2003.
get paper in pdf.
"transonics: a speech to speech system for english-persian interactions" (s. narayanan, s. ananthakrishnan, r. belvin, e. ettaile, s. ganjavi, p. georgiou, c. hein, s. kadambe, k. knight, d. marcu, h. neely, n. srinivasamurthy, d. traum, and d. wang),
proc. ieee asru, 2003.
"finding the write stuff: automatic identification of discourse
structure in student essays" (j. burstein, d. marcu, and k. knight),
ieee intelligent systems, jan/feb, 2003.
"translation with scarce bilingual resources"
(y. al-onaizan, u. germann, u. hermjakob, k. knight, p. koehn,
d. marcu, k. yamada),
machine translation, 2003.
"fast decoding and optimal decoding for machine translation"
(u. germann, m. jahr, k. knight, d. marcu, and k. yamada),
artificial intelligence, 2003.
"teaching statistical machine translation" (k. knight),
proc. mt summit ix workshop on teaching machine translation, 2003.
get paper in pdf.
g. kondrak, d. marcu and k. knight. "cognates can improve statistical
translation models". human language technology conference of the north
american chapter of the association for computational linguistics
(hlt-naacl 2003), companion volume, pp. 46-48, edmonton, may 2003.
get paper in pdf.
"desperately seeking cebuano"
(douglas w. oard, david doermann, bonnie dorr, daqing he, philip resnik, amy weinberg,
william byrne, sanjeev khudanpur, david yarowsky,
anton leuski, philipp koehn, and kevin knight),
proc. naacl-hlt, 2003.
2002
"using a large monolingual corpus to improve translation accuracy"
(r. soricut, k. knight, and d. marcu),
proceedings of the 6th amta conference, 2002.
"learning a translation lexicon from monolingual corpora"
(p. koehn and k. knight),
proc. of acl workshop on unsupervised lexical acquisition, 2002.
"summarization beyond sentence extraction: a probabilistic
approach to sentence compression" (k. knight and d. marcu),
artificial intelligence, 139(1), 2002.
"named entity translation: extended abstract"
(y. al-onaizan and k. knight),
proc. hlt, 2002.
get paper in postscript.
"translating named entities using monolingual and bilingual resources"
(y. al-onaizan and k. knight),
proc. of the
conference of the association for computational linguistics (acl), 2002.
get paper in postscript.
"machine transliteration of names in arabic text"
(y. al-onaizan and k. knight),
proc. of acl workshop on computational approaches to
semitic languages, 2002.
get paper in postscript.
"a decoder for syntax-based statistical mt"
(k. yamada and k. knight),
proc. of the
conference of the association for computational linguistics (acl), 2002.
get paper in postscript.
2001
"a syntax-based statistical translation model"
(k. yamada and k. knight),
proc. of the
conference of the association for computational linguistics (acl), 2001.
get paper in postscript.
"fast decoding and optimal decoding for machine translation"
(u. germann, m. jahr, k. knight, d. marcu, and k. yamada),
proc. of the
conference of the association for computational linguistics (acl), 2001.
best paper award.
get paper in pdf.
"knowledge sources for word-level translation models"
(p. koehn and k. knight),
empirical methods in natural language processing conference
(emnlp), 2001.
get paper in postscript.
2000
"translating with scarce resources"
(y. al-onaizan, u. germann, u. hermjakob, k. knight, p. koehn,
d. marcu, k. yamada),
national conference on artificial intelligence (aaai), 2000.
get paper in postscript.
"preserving ambiguities in generation via automata intersection"
(k. knight and i. langkilde),
national conference on artificial intelligence (aaai), 2000.
get paper in postscript.
"estimating word translation probabilities from unrelated monolingual
corpora using the em algorithm"
(p. koehn and k. knight),
national conference on artificial intelligence (aaai), 2000.
get paper in postscript.
"statistics-based summarization --- step one: sentence compression,"
(k. knight and d. marcu),
national conference on artificial intelligence (aaai), 2000.
outstanding paper award.
get paper in postscript.
1999
"decoding complexity in word-replacement translation models,"
computational linguistics, squibs & discussion, 25(4), 1999.
get paper in postscript (long version).
"mining online text," communications of the acm, 42(11), november 1999.
"a statistical mt tutorial workbook," unpublished, august 1999.
get paper in word.
"a computational approach to deciphering unknown scripts,"
(k. knight and k. yamada),
proceedings of the acl workshop on unsupervised learning
in natural language processing, 1999.
get paper in postscript.
1998
"machine transliteration,"
(k. knight and j. graehl),
computational linguistics, 24(4), 1998.
get conference version in pdf.
"translation with finite-state devices," (k. knight and y. al-onaizan),
proceedings of the 4th amta conference, 1998.
get paper in postscript.
"generation that exploits corpus-based statistical knowledge,"
(i. langkilde and k. knight), proc. of the
conference of the association for computational linguistics (coling/acl), 1998.
get paper in postscript.
"translating names and technical terms in arabic text,"
(b. stalls and k. knight), proc of the
coling/acl workshop on computational approaches to semitic languages,
1998.
get paper in postscript.
"the practical value of n-grams in generation,"
(i. langkilde and k. knight), proc. of the
international natural language generation workshop, 1998.
get paper
in postscript.
1997
"automating knowledge acquisition for machine translation,"
ai magazine 18(4), 1997.
get paper in postscript.
"machine transliteration,"
(k. knight and j. graehl), proc. of the
conference of the association for computational linguistics (acl), 1997.
get paper in pdf.
1996
"it's time for your evaluation," special section,
what makes a compelling empirical evaluation in ai, ieee expert, 11(5), 1996.
"learning word meanings by instruction," proc. of the
national conference on artificial intelligence (aaai), 1996.
get paper in postscript.
1995
"two-level, many-paths generation,"
(k. knight and v. hatzivassiloglou), proc. of the
conference of the association for computational linguistics (acl), 1995.
get paper in postscript.
"filling knowledge gaps in a broad-coverage mt system,"
(k. knight, i. chander, m. haines, v. hatzivassiloglou,
e. hovy, m. iida, s. k. luk, r. whitney,
and k. yamada), proc. of the
international joint conference on artificial intelligence (ijcai), 1995.
get paper in postscript.
"unification-based glossing,"
(v. hatzivassiloglou and k. knight), proc. of the
international joint conference on artificial intelligence (ijcai), 1995.
get paper in postscript.
1994/earlier
"integrating knowledge bases and statistics in mt,"
(k. knight, i. chander, m. haines, v. hatzivassiloglou, e. hovy,
m. iida, s. luk, a. okumura, r. whitney, k. yamada),
proc. of the conference of the association for machine translation
in the americas (amta), 1994.
get paper in postscript.
"building a large-scale knowledge base for machine translation,"
(k. knight and s. luk),
proc. of the national conference on artificial intelligence (aaai), 1994.
get paper in postscript.
"automated postediting of documents,"
(k. knight and i. chander),
proc. of the national conference on artificial intelligence (aaai), 1994.
get paper in postscript.
"are many reactive agents better than a few deliberative ones?"
proc. of the
international joint conference on artificial intelligence (ijcai), 1993.
"unification," the encyclopedia of
artificial intelligence, second edition, john wiley and sons, 1992.
"integrating knowledge acquisition and language acquisition,"
applied intelligence, 1(1), 1992.
artificial intelligence, second edition (e. rich and k. knight),
mcgraw-hill book company, 1991.
"connectionist ideas and algorithms," communications of the acm,
33(11), november, 1990.
"knowledge and natural language processing" (j. barnett, k. knight,
i. mani, and e. rich), communications of the acm, 33(8), august, 1990.
"unification: a multidisciplinary survey," acm computing
surveys, 21(1), 1989.
get paper in pdf.
Acceuil
suivante
kevin knight's home page Biographie Kevin Costner, Bio Kevin Costner - People Club France Mobiles : Portail de téléphonie mobile - Kevin (Logos Couleurs) Le profil de Kévin Gontier (FACULTE DE DROIT DE TOURS) sur ... Télépoche.fr : Britney et Kevin privés de tatouages Kevin Bacon Gluttonous Fréquenter L'oubli lyrics - Kevin Moroney lyrics Doug Wilson - Kevin Nealon Weeds - La serie de Showtime Blog Nascar: Une saison avec Kevin Harvick épisode #6. Boogity ... Kevin Parent nouveau spectacle Énergie Kevin & Howlin - Dublin - Critiques de Kevin & Howlin - TripAdvisor Kevin Spacey - Ciné Annales METROTECHNO : KEVIN SAUNDERSON - - ANTWERPEN-NOXX - ANTWERPEN ... Kevin Zegers Sports - Orange - NBA - Kevin Durant (Meneur) - Seattle Discographie de Kevin Pearce - James Prior- Fnacmusic ... Kevin Misher - uniFrance Bagatelles et légèreté: Kevin Kévin et les examens apres sa naissance - le blog kevinou La Taverne - Kevin et son pc Chupastars - Kevin - Equipe Skwat.com User » kevin Kevin DENIS : BONS DEBUTS EN PROMOSPORT 600 :: Cineguns :: Galerie de Kevin Spacey Kevin Everett a subi une blessure «catastrophique» Football ... Kevin Dean Kevin et Jade, vainqueurs de Koh Lanta 2007 - My Little Buzz La soul de Kevin Michael Le profil de Kevin Renaud (EFREI) sur Reseaucampus Arts&Crafts: Broken Social Scene Presents Kevin Drew Spirit If... LesPipoles - Kevin Costner actualité news photos videos en streaming www.kevinkmp.com - blog KevinDonahue.com Mr. Brooks - Avec Kevin Costner, Demi Moore : Première affiche Kevin Photo : planche-urbanr-kevin.jpg - Couleur / color - kahouetcity ... Arnaud TOULON: TADAAAAAM ! c'était COYOTE !!! (little kevin, les ... French Football League : Kévin ANIN B. Kevin Turner: Chief Operating Officer Kevin Poulsen Parent Kevin Guitar Tabs for - Mother Of Our Child Tabs, Chords ... Kevin Hightouch Full of IT Real Lawyers Have Blogs : Law Blogs, Blawgs, Law Firm Marketing Strange Attractor: Picking out patterns in the chaos Kevin Zegers Forum La philosophie autrichienne de Bolzano à Musil: histoire et actualité - Résultats Google Recherche de Livres Anonymi, magistri artium (c. 1246-1247) Sententia super II et III ... - Résultats Google Recherche de Livres Kevin Brown Kevin with a "K": Kevin with a "K" - Résultats Google Recherche de Livres Kevin Garnett - Résultats Google Recherche de Livres Kevin Jones: Snowboarding Superstar - Résultats Google Recherche de Livres Biblical and Near Eastern Essays: Studies in Honour of Kevin J. ... - Résultats Google Recherche de Livres Kevin Jones: Snowboarding Champion - Résultats Google Recherche de Livres City Sense and City Design: Writings and Projects of Kevin Lynch - Résultats Google Recherche de Livres Kevin Garnett/Shake Up The Gam - Résultats Google Recherche de Livres Kevin Garnett: "Da Kid" - Résultats Google Recherche de Livres le site de kévin trantkat hérault Les meilleurs liens kevin sur EntrevueWeb - Découvrez les liens kevin Kévin Blog Gay amateur exhib TBM (uro)