Thread - Nostr Hypermedia

Relays: 5

Replies: 9

Generated: 13:12:27

BlackCoffee bc@nostr.com npub1dqep...mmgt

is it just me or are some LLMs getting lazier? Is this throttling?

2025-11-06 10:15:31 from 1 relay(s) 4 replies ↓

Replies (9)

Petulant petulant@nostrplebs.com npub1pq96...2fgq

Except Grok 4 deep thinking

2025-11-06 10:39:28 from 1 relay(s) ↑ Parent Reply

alp alp@nostrplebs.com npub175nu...g6w0

Some LLMs throttle, some charge more (tokens) towards the end of a session, because of their context memory, I heard. And some just get dumber. That's why I frequently restart a chat session.

2025-11-06 11:00:20 from 1 relay(s) ↑ Parent 1 replies ↓ Reply

Rod rb@rodbishop.nz npub1r0d8...fsft

This was pretty lazy. nostr:nevent1qqsxeruypmcdf2wy09er0h9kdqmksa8tr2sgz7key66wg052rdzep9cpzemhxue69uhkummnw3ex2mrfw3jhxtn0wfnj7q3qr0d8u8mnj6769500nypnm28a9hpk9qg8jr0ehe30tygr3wuhcnvsxpqqqqqqzyptr93

2025-11-06 11:15:33 from 1 relay(s) ↑ Parent 1 replies ↓ Reply

BlackCoffee bc@nostr.com npub1dqep...mmgt

exactly. i'm seeing more of this sort of thing.

2025-11-06 11:47:45 from 1 relay(s) ↑ Parent Reply

BlackCoffee bc@nostr.com npub1dqep...mmgt

interesting. even at the start of a session i'm noticing laziness. like a jr. dev browsing reddit instead of working. maybe they trained it on time logging software data too 😆

2025-11-06 11:49:14 from 1 relay(s) ↑ Parent 1 replies ↓ Reply

alp alp@nostrplebs.com npub175nu...g6w0

Ok, that sounds like a different issue.

2025-11-06 11:52:18 from 1 relay(s) ↑ Parent Reply

Ape Mithrandir apemithrandir@apemithrandir.com npub16dsu...h6vy

None of these AI services are covering their server costs. As more users join their losses mount. They will have to either charge more or offer a worse experience, ie quantized models, less tokens, smaller context windows etc. It is trivial for them automatically switch the model without you knowing and give you a response that cost them less to produce.

2025-11-06 14:14:56 from 1 relay(s) ↑ Parent 1 replies ↓ Reply

BlackCoffee bc@nostr.com npub1dqep...mmgt

Yes. This is why they've hidden the model that's being used by default in the UIs

2025-11-06 18:01:53 from 1 relay(s) ↑ Parent 1 replies ↓ Reply

Ape Mithrandir apemithrandir@apemithrandir.com npub16dsu...h6vy

Research how much in hardware and electricity it would cost to run the latest and greatest models 24/7. It is insane. Some of the unquantized models require over a terabyte of VRAM to run!

2025-11-06 23:24:36 from 1 relay(s) ↑ Parent Reply