Confidence level for CoreNLP Open IE Json format - stanford-nlp

I'm using the python wrapper for CoreNLP Open IE and outputting in json format, but I cannot see the confidence scores.
I tried adjusting the properties following this answer: Stanford CoreNLP OpenIE annotator but had no luck. Am I missing something, or has this feature still not be enabled?

Related

Is SyntaxNet Compatible with Open NLP?

I am new to OpenNLP and have used it to get some parsing sentences.
I saw in Google TensorFlow youtube videos that the Penn Treebank is old and sort of outdated.
They have made another parsing model named SyntaxNet available as Open Source.
My question is that is it possible to use SyntaxNet Models in Apache OpenNLP libraries.
SyntaxNet is a part of TensorFlow repo in git and is Python based.
I am a Java Developer.
Thanks in Advance.
OpenNLP would pick any tagging or syntactic/semantic model as long as you create the training data yourself. In this case, you would need to train POS tagger with their set of tags and also OpenNLP chunker and / or parser to implement SyntaxNet.
That said, my personal take on this is that SyntaxNet is a very opinionated piece and there is no reason to use it for instead of Penn Treebank. By doing so, you're locking yourself in a Google's solution.

Natural Language Process using SharpNLP with a sample

I am really new to C# and want to do a NLP project using SharpNLP. I know Currently it provides the following NLP tools:
sentence splitter
tokenizer
part-of-speech tagger
chunker
parser
name finder
coreference tool
interface to the WordNet lexical database
I tried several examples (i have those .nbin models) but failed to integrate the SharNLP tools to VS 2015. Can anyone give some guidance or samples to use this sharpNLP tool with VS.
Thanks
I have successfully created a sample project for newbies. You can get the project from the following link. PS:Please change the nbin file path as your's computer file path. Hope this will help.
Click HERE for Sample Project or as below.
https://drive.google.com/file/d/0B3XcMZLArSF1UURzODRiVmE0RUE/view?usp=sharing

Create User Guide documentation with Maven

I need to write and build easy maintainable, goodlooking, esay to change documentation in pdf and html 5 format. The source format must be easy to edit. This maven plugin has to support my company organziation theam(fonts, colors, pictures etc.), TOC generation, separation of chapters in different files, integration of images files, easy way to put code snipets in the documentation. I have Maven build and I was wondering what is the current best descision to do that?
I was investigate two options:
Doxia - using md(markdown) as input format. There is WSWG md free
editor, support a lot of the aforemention stuffs, etc. Need external repo for its artifacts.
Asciidoctor - use asdcii doc as input format. Support templating using fragments etc.
What are the advantages and disadvantaes of using this plugins?
Are there any other good solutions?
From my attempts to build the documentation first with Doxia and then with AsciiDoc I realised that Asciidoc is the better. It allows
Easy styling using yaml files. Default styling is also very good.
Asciidoc as a mark-up language is very good documented: Uder Guide AsciiDoc
Has good online editors and the language is more powerful than markdown for example, and easier to write in comparison to the xml format.
Good examples when using with Maven and ascii doc and easy to understand configurations.

Does Stanford Core NLP support lemmatization for German?

I found German parse and pos-tag models which are compatible with Stanford Core NLP. However I was not able to get German lemmatization working. Is there a way to do so?
Sorry, as far as I know no implementation of German lemmatization exists for Stanford CoreNLP.
Even the latest version 4.4.0 of CoreNLP still does not support lemmatization for German. See https://stanfordnlp.github.io/CoreNLP/human-languages.html for reference.
Since the version 3.6 is also German supported.
Check it under http://stanfordnlp.github.io/CoreNLP/history.html

How to parse word documents with ruby?

Does anyone know of a library that I can use on OS X/Linux to parse Word files and output the content as HTML?
I've had a look at win32ole but as far as I can see it's for Windows only, although I could be wrong.
Any suggestions?
The Word document format (ignoring docx for the moment) is terrible and was constantly changing. IMHO that is why there are so few (read: zero) Ruby libraries out there to parse them.
What I recommend doing is using JRuby and some of the established Java libraries for reading the doc format. Google should help you out there: http://schmidt.devlib.org/java/libraries-word.html.
There is a Java project for reading MIcrosoft file formats, POI (http://poi.apache.org/) and they do have Ruby bindings (http://poi.apache.org/poi-ruby.html) but I'm not sure how up-to-date those are. On their site it says the Ruby bindings are for 1.8.2...

Resources