good chemistry is complicated, and a little bit messy -LW |
|
PerlMonks |
Re: End of sentence regex excluding " i.e." and " e.g."by kennethk (Abbot) |
on Feb 06, 2017 at 18:00 UTC ( [id://1181188]=note: print w/replies, xml ) | Need Help?? |
First, to be pedantic, e.g. and i.e. should always be followed by a comma, so you are dealing with grammatical errors. http://www.dailywritingtips.com/comma-after-i-e-and-e-g/.
There isn't a general solution to this problem because of names (e.g, H.G. Wells) and quoting, but perhaps will be sufficiently robust for your need? In general, for a corpus like this, I'd split it into known good, known bad, and grey, and then use test-driven development in order to build out my filter. Update: Augmented regex for ! and ? #11929 First ask yourself `How would I do this without a computer?' Then have the computer do it the same way.
In Section
Seekers of Perl Wisdom
|
|