Beefy Boxes and Bandwidth Generously Provided by pair Networks
Come for the quick hacks, stay for the epiphanies.
 
PerlMonks  

Re^3: Repair malformed XML

by rg0now (Chaplain)
on Feb 03, 2005 at 17:38 UTC ( [id://427730]=note: print w/replies, xml ) Need Help??


in reply to Re^2: Repair malformed XML
in thread Repair malformed XML

I am a little lost here. You told us that all the problems you have with your XML is that it has some unclosed tags. XML::LibXML::Parser's recover flag will handle it, as the manual tells:

"The recover mode helps to recover documents that are almost wellformed very efficiently. That is for example a document that forgets to close the document tag (or any other tag inside the document)."

Now, you seem to indicate that some tags in your XML are corrupt. Well, I do not really know, how to handle that one...

Also, I do not think that you hit some obscure size limitations of XML::LibXML (you seem to get the error at the 85th input line).

Replies are listed 'Best First'.
Re^4: Repair malformed XML
by spoulson (Beadle) on Feb 03, 2005 at 17:55 UTC
    Sorry if I was unclear. The XML data is not corrupt. It appears that LibXML cannot load an 80MB XML without corrupting its own data. When I search within the XML, I do not find the offending parser error on line 85, or anywhere in the file.

    So, I believe it to be a size limitation that causes internal memory management issues. Why 85th line? Maybe a pointer wrapped and happened to clobber the 85th line. Who knows. :/

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://427730]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others having a coffee break in the Monastery: (6)
As of 2024-04-18 01:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found