Beefy Boxes and Bandwidth Generously Provided by pair Networks
Clear questions and runnable code
get the best and fastest answer
 
PerlMonks  

Wikipedia content to text converter

by vit (Friar)
on Jul 30, 2012 at 22:26 UTC ( #984535=perlquestion: print w/replies, xml ) Need Help??

vit has asked for the wisdom of the Perl Monks concerning the following question:

Dear Monks,
I dumped wikipedia as an XML file and want to extract only a valuable content to plain text.
I do not want to rewrite this code since this looks to be a task which many people might be interested in and this code should exist.
So if somebody could share, this will be very much appreciated.

Replies are listed 'Best First'.
Re: Wikipedia content to text converter
by Anonymous Monk on Jul 30, 2012 at 22:28 UTC

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: perlquestion [id://984535]
Approved by ww
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others exploiting the Monastery: (3)
As of 2023-12-01 00:18 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found

    Notices?