Beefy Boxes and Bandwidth Generously Provided by pair Networks
Welcome to the Monastery
 
PerlMonks  

Re: Is there a simple way to archive/download all of PerlMonks?

by nikosv (Deacon)
on Apr 28, 2024 at 09:10 UTC ( [id://11159129]=note: print w/replies, xml ) Need Help??


in reply to Is there a simple way to archive/download all of PerlMonks?

Actually that could be useful in training/fine tuning a local LLM
on the collective Perlmonks threads/data so you can ask free style ChatGPT alike questions on it.
  • Comment on Re: Is there a simple way to archive/download all of PerlMonks?

Replies are listed 'Best First'.
Re^2: Is there a simple way to archive/download all of PerlMonks?
by marto (Cardinal) on Apr 28, 2024 at 09:33 UTC

    This will probably be illegal in future, cruelty to AI.

      I think otherwise: compared to SO, it would be a merciful treatment;-)

        I was thinking along the lines of science fiction stories like Neuromancer, where "AI" was granted some rights and legal protections. Regardless, without sufficient safeguards I don't think this would be a good idea with this sites content. It may be better that than SO, but that's nor really the point.

Re^2: Is there a simple way to archive/download all of PerlMonks?
by LanX (Saint) on Apr 28, 2024 at 11:47 UTC
    I'm very far from being a "prompt engineer", but I suppose it should be already possible to tell AI-search only to consider perlmonks.

    The harder part is ignore everything from certain "BS monks" ;)

    Cheers Rolf
    (addicted to the Perl Programming Language :)
    see Wikisyntax for the Monastery

      This is only if the perlmonks text was swept up in the training data. It's probably in our best interest to make perlmonks more downloadable so that this body of information is available to LLM tools. People might actually decide to use or not use perl for a task based on how well ChatGPT can answer questions about it. Lately I've been asking it a bunch of questions about Vue3 and amazes how useful the answers are (as a search engine, it still doesn't write accurate code).
        I've found an genious way to check if content from perlmonks was used.

        I asked! ;)

        While PerlMonks.org is a publicly available website, I want to clarify that I have not been specifically trained on content from PerlMonks.org. My training data includes a variety of publicly available texts and sources related to Perl programming, but the specific content from PerlMonks.org has not been used in my training. If you have any questions or need assistance with Perl code, feel free to ask, and I'll do my best to help!

        That was "chatgpt 3.5 turbo" as provided by duckduckgo.

        Cheers Rolf
        (addicted to the Perl Programming Language :)
        see Wikisyntax for the Monastery

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11159129]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others exploiting the Monastery: (4)
As of 2025-06-15 17:51 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found

    Notices?
    erzuuliAnonymous Monks are no longer allowed to use Super Search, due to an excessive use of this resource by robots.