This is fantastic. I couldn't find any obvious way to search for a new page, but you can simply bang out any arbitrary URL slug and the new article will be hallucinated fresh, eg:
Currently breaks if you try to create a page with a Japanese slug. Multiple languages would make this an even more valuable resource than it already is.
It's pretty fun to poke at! Although it's certainly difficult to be exact, it would be neat if generated pages used the context of the pages they were linked from (ideally, all pages that link to it) to guide the direction of the page. From the ones I generated it seemed they were mostly independent.
I get that, but how does it serve the generated and cached ones seemingly faster than Wikipedia? (My guess is that single-page applications, which this one seems to be, just need less round trips between navigations or something?)
It's probably only harmful to the AI scrapers that train from the web. Most people will understand the purpose of this -- to poison LLM training in a humorous way, which is really easy to do. It exemplifies a major weakness in modern day AI.
You could also argue that the web has failed and poisoning it into irrelevance is a vital service, motivating humans to collect knowledge into immutable sources. We‘ll call them ‘libraries.’
To the web? It's fantastic for the web, these are the kinds of fun projects that make the web a worthwhile place to be. To slop generators? Yes, absolutely harmful, and that's for the best.
You not only made this excellent source of entertainment, you are also helped everyone find their unmatched socks, ensuring that "no individual would ever be forced to wear a mismatched pair". (Source: https://halupedia.com/humanitarian-accomplishments-of-the-on...
The page requires JS to load its content - user agents without JS support just get a blank page.
I'm not sure if the bots that scrape data to train LLMs are capable of loading that type of page, or if they only work on pages that have the content inside the HTML itself?
any serious scraping service these days will fail over to a headless browser when it fetches an asset referencing a js bundle that isn't verifiably a vendor script
Great idea! I created an adjacent website that gives, shall we say, "alternative facts" about your questions. (don't know if the rules allow me to link the site so I won't).
Funny. Small improvement suggestion: the entry about "Glorbonian culinary arts" links to "the subterranean nation of Glorbonia". However upon clicking the link to "Glorbonia", an entry is generated claiming that "Glorbonia refers to a peculiar and largely uncatalogued form of sub-auditory resonance". It would be cool if some context were carried over from the referrer page so that there is some coherence between entries (ah, and some existing entries could be taken in account when generating new ones).
wtf, I thought these were just anecdotes until I saw they were actually happening in Astoria. I used to visit in the summers and never heard about any of that! Stop the fake news
This is fantastic. I couldn't find any obvious way to search for a new page, but you can simply bang out any arbitrary URL slug and the new article will be hallucinated fresh, eg:
https://halupedia.com/shortest-cave-in-the-world
https://halupedia.com/echolocation-ability-in-spiders
Exactly, but I consider adding fake search that could find you ANY article, including not existent ones
Yes, that would be the perfect touch. This is brilliant satire. We need more satire!
Currently breaks if you try to create a page with a Japanese slug. Multiple languages would make this an even more valuable resource than it already is.
It's pretty fun to poke at! Although it's certainly difficult to be exact, it would be neat if generated pages used the context of the pages they were linked from (ideally, all pages that link to it) to guide the direction of the page. From the ones I generated it seemed they were mostly independent.
Yeah, thought about that, maybe will implement it. Will keep in mind! For now SSR to feed LLMs' the priority
Ironically, this seems much faster (for pages already, erm, "researched") than the real one! How?
It generates articles only once. So once it's generated, it never perish. Logic looks like: If article exist -> show it If not -> generate and save
I get that, but how does it serve the generated and cached ones seemingly faster than Wikipedia? (My guess is that single-page applications, which this one seems to be, just need less round trips between navigations or something?)
Yep, just a react. Also we use gemini 2.5 flash lite, so it's fast, cheap and dumb.
Funny, but you could argue this is actively harmful to the web.
It's probably only harmful to the AI scrapers that train from the web. Most people will understand the purpose of this -- to poison LLM training in a humorous way, which is really easy to do. It exemplifies a major weakness in modern day AI.
Interesting, but you could argue comments like this are actively harmful to the web.
But the argument wouldn't be nearly as strong.
The sooner the current web dies, the better. Something better either rises from its ashes, or we lose... something that was already lost.
or something way worse shows up.
Yea, I'm not sure how the "this is really bad so let's make it worse" argument really makes any sense
context. sometimes things simply have to be broken to give way for something better. ymmv.
You could also argue that the web has failed and poisoning it into irrelevance is a vital service, motivating humans to collect knowledge into immutable sources. We‘ll call them ‘libraries.’
On the other hand, one could argue that anything that can be destroyed by relatively clearly labeled satire, deserves to be.
Grokipedia is already doing that.
> you could argue
Could you? I don't see it happening, but I could be wrong.
To the web? It's fantastic for the web, these are the kinds of fun projects that make the web a worthwhile place to be. To slop generators? Yes, absolutely harmful, and that's for the best.
Pissing on a pile of shit
Give it a week and see what Google AI Overview has to say about the Great Pigeon Census of 1887!
Finally a more trustworthy version of Grokipedia!
It's hilarious, you made my day hahah
I honestly forgot that Grokipedia existed. Did anyone ever use it?
Tried once, but was useless. Very funny that it had so many text, while Elon is apparently "huge" fan of short and precise communication...
Somebody showed me it appearing near the top of some of their DuckDuckGo queries.
UPDATE: Just now, comment section added. Have a nice time arguing!
You are a wonderful person.
You not only made this excellent source of entertainment, you are also helped everyone find their unmatched socks, ensuring that "no individual would ever be forced to wear a mismatched pair". (Source: https://halupedia.com/humanitarian-accomplishments-of-the-on...
Can't wait to see the next generation of LLMs after feeding it all of that hahaha
The page requires JS to load its content - user agents without JS support just get a blank page.
I'm not sure if the bots that scrape data to train LLMs are capable of loading that type of page, or if they only work on pages that have the content inside the HTML itself?
any serious scraping service these days will fail over to a headless browser when it fetches an asset referencing a js bundle that isn't verifiably a vendor script
It's entirely possible they simply ingest the JS as-is.
I'm aware and will implement SSR soon ;)
Seeing “Something broke, which is ironic for a made-up encyclopedia: Load failed” when trying to access some of the suggested starting points
Works on my PC.
Could you gimme the url that's failing?
It's nice, but after a few clicks my LLM content fatigue kicks in.
This site is going to be expensive when a web crawler hits it. A honey pot that burns tokens.
Great idea! I created an adjacent website that gives, shall we say, "alternative facts" about your questions. (don't know if the rules allow me to link the site so I won't).
Now I want to know the site.
Hm, the page generated seems inconsistent with the usage of the original link.
Funny. Small improvement suggestion: the entry about "Glorbonian culinary arts" links to "the subterranean nation of Glorbonia". However upon clicking the link to "Glorbonia", an entry is generated claiming that "Glorbonia refers to a peculiar and largely uncatalogued form of sub-auditory resonance". It would be cool if some context were carried over from the referrer page so that there is some coherence between entries (ah, and some existing entries could be taken in account when generating new ones).
wtf, I thought these were just anecdotes until I saw they were actually happening in Astoria. I used to visit in the summers and never heard about any of that! Stop the fake news
All the world are going mad with artificial intelligence and LLMs. Just disgusting!
I LOVE IT. Superb.
Love it! It feels very Borges!
Feature request: also be able to click on the Talk page to see the controversies. I don't always want to trust the article itself as the final word.
Edit: Oh look, there's an article about the YC! https://halupedia.com/y-combinator
Just added comment section :)
Great suggestion! Will immediately look into that!
> Edit: Oh look, there's an article about the YC! https://halupedia.com/y-combinator
This should be on YC's About page.
Who says llms can't be funny?!