Beefy Boxes and Bandwidth Generously Provided by pair Networks
Think about Loose Coupling
 
PerlMonks  

Wikipedia content to text converter

by vit (Friar)
on Jul 30, 2012 at 22:26 UTC ( [id://984535]=perlquestion: print w/replies, xml ) Need Help??

vit has asked for the wisdom of the Perl Monks concerning the following question:

Dear Monks,
I dumped wikipedia as an XML file and want to extract only a valuable content to plain text.
I do not want to rewrite this code since this looks to be a task which many people might be interested in and this code should exist.
So if somebody could share, this will be very much appreciated.

Replies are listed 'Best First'.
Re: Wikipedia content to text converter
by Anonymous Monk on Jul 30, 2012 at 22:28 UTC

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: perlquestion [id://984535]
Approved by ww
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others cooling their heels in the Monastery: (2)
As of 2025-06-22 00:11 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found

    Notices?
    erzuuliAnonymous Monks are no longer allowed to use Super Search, due to an excessive use of this resource by robots.