Code for recent papers
The code for my recent papers is available at the Utah NLP github repository. Please follow the github projects and let us know if things don’t work.
Software and Demos
Here is a list of software and demos that I have written, contributed to, released and/or possibly maintain. (Since most of the people involved in these have moved, not all the links may be current.)
NLP Software from Illinois
Edison: Edison is a Java library for representing different NLP annotations (views) over text in the form of graphs over constituents. It provides easy-to-use accessors for different types of views and facilitates feature extraction. Here is a presentation I gave which introduces the API.
Cogcomp core utilities: This is a collections of utility classes that I found myself writing again and again for several NLP and Machine Learning related stuff. The Java standard library should really have this functionality (the
Pairclass, for example.) This is not strictly an NLP library, but a collection of Java classes that are useful for NLP applications.
Curator: The curator manages several NLP components and provides uniform access to them. It also caches annotations. The Curator and Edison are closely linked to each other and are described in [this paper][curatorPaper], which was published in LREC 2012.
Semantic Role Labeling: The Java version of the Illinois Semantic Role Labeler. The SRL system is based on the work of [Punyakanok, Roth and Yih, 2008][punyakanokSRLCitation] and is a complete rewrite of their system, with additional bells and whistles. You can get it by downloading the Curator.
Online Demos at Illinois
Semantic Role Labeling: A demo of the SRL system. This also includes the preposition relations that are described in my thesis.
[Dataless Classification][datalessClassification]: A demo of our [AAAI 2008 paper][datalessPaper]. It takes text and two arbitrary textual labels and decides which label better describes the text. The demo uses our home-grown implementation of Explicit Semantic Analysis.
Machine learning code
Code that I either use or have somehow helped during development or both.
Wolfe is a Scala library for quickly building learning based applications. It is primarily driven by Sebastian Riedel, Sameer Singh, Tim Rocktäschel and co. Also check out Moro which lets you build a Scala notebook in the browser like IPython.
Illinois SL: A package for structured learning in Java.
p [punyakanokSRLCitation]: http://cogcomp.cs.illinois.edu/papers/PunyakanokRoYi07.pdf [curatorPaper]: ../research/2012-5-lrec.html [datalessClassification]: http://cogcomp.cs.illinois.edu/page/demo_view/Dataless [datalessPaper]: ../research/2008-aaai2.html