Astro - Hacker News

6 comments

Lerc a minute ago

Ok now we need 1541 flash attention.
I'm not sure what the venn diagram of knowledge to understand that sentence looks like, it's probably more crowded in the intersection than one might think.
wk_end 32 minutes ago

> 25K parameters is about 70 million times smaller than GPT-4. It will produce broken sentences. That's the point - the architecture works at this scale.
Since it seems to just produce broken and nonsensical sentences (at least based on the one example given) I'm not sure if it does work at this scale.
Anyway, as written this passage doesn't really make a whole lot of sense (the point is that it produces broken sentences?), and given that it was almost certainly written by an AI, demonstrates that the architecture doesn't work especially well at any scale (I kid, I kid).
[-]
- forinti 17 minutes ago
  
  How does it compare to a Markov chain generator I wonder.
harel 37 minutes ago

Eliza called, and asked if we saw her grand kids...
[-]
- tclancy 14 minutes ago
  
  What makes you say that? This is about you, not me.
  (Came here to say an update to Eliza could really mess with the last person still talking to her.)
bighead1 35 minutes ago

i hate ai, and i love the c64, but i'll allow it.