Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re^4: heuristic to detect (perl) code

by LanX (Chancellor)
on Jan 19, 2013 at 09:53 UTC ( #1014193=note: print w/replies, xml ) Need Help??


in reply to Re^3: heuristic to detect (perl) code
in thread heuristic to detect (perl) code

I just need a fuzzy heuristic "looks like code" with more than 80% reliability.

(EDIT: Please keep in mind that posted code is not necessarily valid code)

> OTOH :) Re^2: CSS Show and Tell: Colored Code

Thanks looking into code-mirror might help.

OTOH I tried Free Nodelet hack: Syntax colouring some time ago and had to deactivate it because of the caused overhead.

So the most highest detection rate per necessary code lines is my goal.

Cheers Rolf

UPDATE:

> PPI is fairly straightforward, s/// is easily converted to .replace, the regex are the simple variety, it is possible

I suppose this was already done within perlito

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1014193]
help
Chatterbox?
[Discipulus]: good morning Corion!
[Corion]: I spent parts of yesterday digging into Filter::Simple to make it not change the line numbers of unfiltered code, but I think I need regex help (which will come in a SoPW I guess). I tried @lines = /((?:"...."|[^" \n]+)\n)/gs, but that matched on...
[Corion]: ... foo "\0\0\0\n" bar\n by matching the first \n instead of matching the four-bytes-in- double-quotes part
[Corion]: ... and I still don't understand why ;)
[Corion]: This is because Filter::Simple does some string-trickery, replacing all string literals with "quoted packed numbers"
[Corion]: I fear this might be a bug in the RE engine, but if it is a bug, even fixing won't help me because I need Filter::Simple for Filter::signatures , which provides signatures as a backwards compatibility feature for Perl <5.22 - and these ...
[Corion]: ... won't get a fix anyway ;) My plan B is to encode the string placeholders avoiding \r and \n

How do I use this? | Other CB clients
Other Users?
Others wandering the Monastery: (10)
As of 2017-01-23 08:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    Do you watch meteor showers?




    Results (191 votes). Check out past polls.