Summary

OpenNLP got off to a quick start in 2017 thanks to a 1.7.0 release on December 31, 2016. This version added support for Java 8 and set the tone for OpenNLP's 2017. In total, there were 7 releases in 2017. OpenNLP also got a new logo and website in 2017 with an updated look and easier navigation. OpenNLP also released its first model, a language detection model capable of identifying 103 languages. OpenNLP moved to GitHub for source management greatly simplifying the process of reviewing and merging pull requests.

Some features and improvements that were added to OpenNLP in 2017 include:

  • A new language model CLI tool.
  • Moses format support.
  • CONLL-U format support.
  • Language codes now are ISO 639-3 compliant.
  • Many more unit tests.
  • Prefix and suffix feature generators are now configurable.
  • Learnable lemmatizer now returns all possible lemmas for a given word and part-of-speech tag.
  • A new language detection component and trained language model.
  • Evaluation tests now support ISO-639-3 language codes.
  • Fixed handling of xml parsers used through out the package.
  • New experimental API for word vectors and support for GloVe vector files.
  • Added annotator notes to BratAnnotator.
  • Add 20Newsgroups format support to the doccat component.
  • Resolved concurrency issue in POS tagger.

Community Development

Apache OpenNLP has added 6 new committers and PMC members in 2017.

Talks and Presentations

Apache OpenNLP was presented at several events in 2017 and there will be more OpenNLP talks in 2018 across the world.

Releases

OpenNLP had 7 releases in 2017. They were:

Release Timeline

Models

The OpenNLP team was very excited to announce the language detection model's release on November 2, 2017. This model is capable of identifying 103 languages. The model is available for download from the OpenNLP website.

Activity

OpenNLP added 6 new committers and PMC members in 2017. There are currently 21 committers and 15 PMC members.

Tasks

  • 289 JIRA tasks were closed in 2017.
  • 346 JIRA tasks were opened in 2017.

Code

  • There were 269 closed pull requests.
  • There were 323 git commits throughout the year:

Notable Use of OpenNLP

OpenNLP powers an Air New Zealand Oscar chat bot.

“Air New Zealand uses OpenNLP to power its chatbot, Oscar. Launched in February 2017, Oscar provides a conversational interface for customers to ask questions about flights, amenities and policies. Using OpenNLP, we’ve been able to consistently provide over 50% conversational success and support hundreds of intents.”