|Syntactic Confectionery Delight|
Re: Going from PDF to GEDCOMby Anonymous Monk
|on Nov 08, 2010 at 16:23 UTC||Need Help??|
run strings, count the number of occurences
and you 'll get the most common words ( burial/in/on/he/she/they/died/born/married )
To get sentences, slurp a page, split on period not followed by a comma (or other punctuation).
Then split into parts based on the common words and do something with them.
But, I've no idea how a sentence (or a bunch) translate into gedcom calls.
How did you generate the sentences in the first place? Reverse that process