Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things

Re: multi line regex

by matija (Priest)
on Jan 09, 2006 at 14:54 UTC ( #521954=note: print w/replies, xml ) Need Help??

in reply to multi line regex

This is wrong in so many ways. First of all, you're parsing HTML with a regex. Don't do that. Use HTML::Parser instead.

Otherwise, there are just too many ways in which you can be tripped - tags with extra white space, tags with newlines, quotes missing or present in unexpected places, escaping of this, that or the other thing, javascript code fooling you into thinking you're in another tag when you really aren't, etc.

Second, you're trying to extract data from an HTML table using regex. Don't do that. Use HTML::TableExtract instead. It will save you a LOT of hairpulling.

Replies are listed 'Best First'.
Re^2: multi line regex
by metalfan (Novice) on Jan 18, 2006 at 17:51 UTC
    looks good, sorry for this question: but how can i use this to
    do geht the word in the first column?

    1.column | 2.column
    english word | german word

    thx for help
      Read the manual pages for HTML::TableExtract - once it parses the table, the first column will be the first element of the row array.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://521954]
Discipulus manual work: i just tell the same to my boss: every time the quick solution is to assign some manual data entry task to my group.. because we have not direct access to many databases here..
[LanX]: point is: in high speed trade each bank has to remember what he has to get from the others... so dresdner got billed for losses but couldn't claim gains
Discipulus is this the IT?
[Corion]: Discipulus: Well, in many cases it doesn't make sense to build an interface and complicated program just to enter 20 rows into a database ;) But yes, automating data imports should pay off in the long run
[LanX]: Choroba: this happened before I joined, was still in uni, but my boss was summoned to the CEO of the second biggest German bank at that time and could only say " I told them its not ready" ;)
[LanX]: memories....I missed my connection while chatting
[Discipulus]: in this case Corion we are speaking about software licensing: evry year or two we must rescan the whole ced to produce an excel report, while at every activation / disactivation we update a black box DB: i said that i a week i can produce the perl to..
[Discipulus]: rend out the xls IF i have access to the DB
[choroba]: LanX I miss working in a bank sometimes...
[Corion]: Discipulus: Ooof. Especially yearly things are things I like to automate instead of trying to remember how I did things last year...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (12)
As of 2017-03-29 12:04 GMT
Find Nodes?
    Voting Booth?
    Should Pluto Get Its Planethood Back?

    Results (350 votes). Check out past polls.