Menu
Menu
DaniWeb
Log In
Sign Up
Read
Contribute
Meet
Search
Search
About 1,000 results for
wiki
- Page 1
Evaluating OpenAI GPT 4.1 for Text Summarization and Classification Tasks
Programming
Computer Science
4 Days Ago
by usmanmalik57
… performance of AI models. [ROUGE metric](https://en.wikipedia.org/
wiki
/ROUGE_(metric)) is one such criterion. The following script defines…
Re: Cannot run exe from asp.net
Programming
Web Development
6 Days Ago
by Salem
… somewhere. You want something like this: https://en.wikipedia.org/
wiki
/Remote_procedure_call There are multiple ways of doing this. Can you…
Accessibility vs design
Digital Media
UI / UX Design
2 Weeks Ago
by Dani
… in UX design? Specifically, [the WCAG](https://en.wikipedia.org/
wiki
/Web_Content_Accessibility_Guidelines)? Personally, I have tried and not been very successful…
Re: Where Can I Find Test Data (Names, Emails, etc.) for My Testing?
Programming
Web Development
3 Weeks Ago
by gediminas.bukauskas.7
Look at Customers table in NorthWind database (https://en.wikiversity.org/
wiki
/Database_Examples/Northwind).
Re: Buggy career talk :-P
Programming
2 Weeks Ago
by Salem
This https://en.wikipedia.org/
wiki
/Peterson%27s_algorithm Plus two different kinds of processors. Plus an …
Re: Accessibility vs design
Digital Media
UI / UX Design
2 Weeks Ago
by rproffitt
In the news: > [Trump Administration Withdraws ADA Guidance ](https://www.disabilityscoop.com/2025/03/20/trump-administration-withdraws-ada-guidance/31368/) Looks like it's no longer a problem.
Benchmarking DeepSeek R1 for Text Classification and Summarization
Programming
Computer Science
2 Months Ago
by usmanmalik57
… ``` We will use the [ROUGE score](https://en.wikipedia.org/
wiki
/ROUGE_(metric)) metric to evaluate the performance of our DeepSeek…
How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by rproffitt
… have the Dissociated Press Examples from https://en.wikipedia.org/
wiki
/Dissociated_press where a script makes interesting replies. AI as it…
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Reverend Jim
… read the wikipedia entry on [Enshitification](https://en.wikipedia.org/
wiki
/Enshittification).
Text Classification and Summarization with DeepSeek R1 Distill Llama 70B
Programming
Computer Science
2 Months Ago
by usmanmalik57
… will use the average [ROUGE scores](https://en.wikipedia.org/
wiki
/ROUGE_(metric)) for all model-generated summaries. The following function…
DeepSeek R1 vs Llama 3.1-405b for Text Classification and Summarization
Programming
Computer Science
1 Month Ago
by usmanmalik57
… summaries and returns the [ROUGE scores](https://en.wikipedia.org/
wiki
/ROUGE_(metric)), a commonly used evaluation criteria for text summarization…
Re: how to get back visual basic 6 project again on coding again
Programming
2 Months Ago
by Salem
… look on this as an opportunity. https://en.wikipedia.org/
wiki
/Visual_Basic_(classic) "On April 8, 2008, Microsoft stopped supporting…
Re: Mention The Popular Blockchain Platforms
Programming
Software Development
2 Months Ago
by Dani
Bitcoin and Ethereum are the biggies. [Here's a comprehensive list.](https://en.wikipedia.org/
wiki
/List_of_blockchains)
Re: ‘Advanced AI should be treated similar to Weapons of Mass Destruction’
Community Center
1 Month Ago
by rproffitt
UPDATE: Feb 4, 2025 — Google on Tuesday updated its ethical guidelines around artificial intelligence, removing commitments not to apply the technology to weapons or surveillance.
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Dani
I don't understand what goal you are trying to achieve? Is your goal to open a dialog about the pros and cons of AI? DaniWeb is powered by Cloudflare. One of the functions of Cloudflare is a sophisticated system to analyze and control how AI crawlers scan the website. In other words, if I want to dissuade AI bots from crawling DaniWeb, I …
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by rproffitt
For example, with Meta and others removing fact checking we should find a way to render their AI and search results full of not so useful information. We are right now veering towards a Fascist state with oligarchs and mega corporations stoking coal into the ovens. We shouldn't be fuel for those ovens.
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Dani
I’m not nearly as much of a conspiracy theorist. I also don’t think that spamming Facebook with nonsensical posts is going to make the world a better place.
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Pebble94464
Don't waste your time, rproffitt. Spamming the web is unlikely to achieve your goals... Firstly, everything you post online is but a wee drop in the ocean. You'd need to do an illegal amount of spamming in order to sway an opinion. Secondly, AI bots crawling the web can be instructed to simply ignore pages that contain censored keywords. AI …
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by rproffitt
I asked around and it appears we can affect change. The immigrant reporting hotline was flooded with reports about Elon Musk so that line shut down. As to AI crawlers the work to poison the AIs is well underway. Examples follow. > Here is a curated list of strategies, offensive methods, and tactics for (algorithmic) sabotage, disruption, …
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Dani
If you're not a part of the solution, you're a part of the precipitate. I think this sounds terrible. The global population is, more and more, relying on AI to serve up accurate answers. There's already the gigantic problem of hallucinations as well as AI consistently spewing out false information that sounds entirely believable, and therefore …
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Dani
> When you price and design a site for an expected human load, and then you get overwhelmed by bots, you can throw more money at it or you can take action against the bots. It's true that the majority of websites on the Internet today spend more bandwidth on bots than they do on human visitors. However, there are both bad bots and good bots, …
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by rproffitt
The OpenAI bot appears to be a bad bot. Discussed many times so here's just one: https://www.reddit.com/r/selfhosted/comments/1i154h7/openai_not_respecting_robotstxt_and_being_sneaky/ Fixes appear to be: 1. Block IP ranges from bots. 2. Replace words and poison the bots.
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Reverend Jim
Thanks for the extra info although I disagree with the spewing comment. Nepenthes and Iocaine do not spew garbage across the web. They feed garbage to bots that access the protected sites. AI that returns bogus results on the ppther hand ARE spewing garbage across the web. BTW Nepenthes makes it clear that implementation will result in being …
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Dani
> The OpenAI bot appears to be a bad bot. This is not my experience. OpenAI respects my robots.txt file perfectly. I do want to add, though, that robots.txt files are very finicky, and I have seen many, many times people blaming the bots when the problem lies with a syntax or logic error in their robots.txt. > Nepenthes and Iocaine do…
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Dani
> The OpenAI bot appears to be a bad bot. Specifically, I would bet quite a large sum of money that the people who are complaining they can't get OpenAI to respect their robots.txt file either have a syntax error in their file, and/or aren't naming the correct user agents. I've seen people mistakingly try to reference a user agent called &…
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Dani
> The creator of Nepenthes says that it is ineffective against OpenAI which I take to mean that OpenAI is ignoring robots.txt. As mentioned, Nepenthes uses the spoofing technique. Spoofing does not rely whatsoever on bots following robots.txt.
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Salem
> But it's also in everyone's interest for AI to be trained on reliable information, if we want AI to be useful to us Yeah, that ship slipped it's mooring when facebook appeared, drifted out to sea on the twitter tide, and promptly sank when muck took it over. Domain specific AI's trained on the likes of https://arxiv.org/ might be worth …
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Reverend Jim
>OpenAI can detect the content thrown at it is nonsensical So OpenAI doesn't crawl Facebook and Twitter? How about Fox News and related sites? And if it ignores Fox, etc, are we thern going to get Trump screaming about radical liberal bias? How does AI distinguish between conspiracy theory and reality?
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Reverend Jim
Remember what happened with Microsoft's chatbot, TAY? It was shut down after only 16 hours when trolls trained it to spout racist slurs and profanity. OpenAI and similar systems are trained on the cesspool that is the entire internet. Sturgeon's Law says 90% of everything is crap. That may well apply to the internet. I'm surprised it hasn't …
Re: How would we poison AI web crawls?
Hardware and Software
Information Security
2 Months Ago
by Dani
> Many places ban or remove AI generated content. We are one of them! :)
1
2
3
17
Next
Last
Search
Search
Forums
Forum Index
Hardware/Software
Recommended Topics
Programming
Recommended Topics
Digital Media
Recommended Topics
Community Center
Recommended Topics
Latest Content
Newest Topics
Latest Topics
Latest Posts
Latest Comments
Top Tags
Topics Feed
Social
Top Members
Meet People
Community Functions
DaniWeb Premium
Newsletter Archive
Markdown Syntax
Community Rules
Developer APIs
Connect API
Forum API Docs
Tools
SEO Backlink Checker
Legal
Terms of Service
Privacy Policy
FAQ
About Us
Advertise
Contact Us
© 2025 DaniWeb® LLC