Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re: Reverse engineering HTML pages to RDF/XML Schema Using ARC2 or Perl ?

by chrestomanci (Priest)
on Dec 07, 2010 at 09:09 UTC ( [id://875758]=note: print w/replies, xml ) Need Help??


in reply to Reverse engineering HTML pages to RDF/XML Schema Using ARC2 or Perl ?

I think you need to be more specific on what you are trying to do. Can you post a link to a page you are trying to parse, or paste in a short fragment, along with what you are trying to extract.

Having said that, if you are tying to parse HTML, then you probably want to use modules such as HTML::TreeBuilder (from CPAN). There have been two threads on this recently: how to quickly parse 50000 html documents?, Parsing HTML files

  • Comment on Re: Reverse engineering HTML pages to RDF/XML Schema Using ARC2 or Perl ?

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://875758]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others contemplating the Monastery: (6)
As of 2024-04-24 04:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found