James Clarke, Vivek Srikumar, Mark Sammons and Dan Roth
LREC 2012.


Natural Language Processing continues to grow in popularity in a range of research and commercial applications, yet managing the wide array of potential NLP components remains a difficult problem. This paper describes Curator, an NLP management framework designed to address some common problems and inefficiencies associated with building NLP process pipelines; and Edison, an NLP data structure library in Java that provides streamlined interactions with Curator and offers a range of useful supporting functionality.


Bib Entry

  author = {James Clarke and Vivek Srikumar and Mark Sammons and Dan Roth},
  title = {{An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)}},
  booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
  year = {2012},
  month = {may},
  date = {23-25},
  address = {Istanbul, Turkey},