Is there an algorithm that can be used to extract simple sentences from paragraphs?
My ultimate goal is to later run another algorithm on the resulted simple sentence to determine the author's sentiment.
I've researched this from sources such as Chae-Deug Park but none discuss preparing simple sentences as training data.
Thanks in advance
Take a look at Apache OpenNLP, it has a Sentence Detector module. The documentation has examples of how to use it from command line and from API.
I have just used openNLP for the same.
Example
It failed only when there is a human mistake. Eg. "Dr." abbreviation should have capital D, and there is at least 1 space is expected between 2 sentences.
You can also achieve it using RE in following way;
Example
But errors are competitively high. Another way is using BreakIterator;
Example:
Benchmarking: