jack's avatar
jack 11 months ago
deepseek gives nostr clients a huge advantage…if they choose to use it

Replies (50)

Default avatar
S 11 months ago
It is if you seek deep in your 🫶
Market close: $NVDA: -16.91% | $AAPL: +3.21% Why is DeepSeek great for Apple? Here's a breakdown of the chips that can run DeepSeek V3 and R1 on the market now: NVIDIA H100: 80GB @ 3TB/s, $25,000, $312.50 per GB AMD MI300X: 192GB @ 5.3TB/s, $20,000, $104.17 per GB Apple M2 Ultra: 192GB @ 800GB/s, $5,000, $26.04(!!) per GB Apple's M2 Ultra (released in June 2023) is 4x more cost efficient per unit of memory than AMD MI300X and 12x more cost efficient than NVIDIA H100! Why is this relevant to DeepSeek? DeepSeek V3/R1 are MoE models with 671B total parameters, but only 37B are active each time a token is generated. We don't know exactly which 37B will be active when we generate a token, so they all need to be ready in high-speed GPU memory. We can't use normal system RAM because it's too slow to load the 37B active parameters (we'd get <1 tok/sec). On the other hand GPUs have fast memory but GPU memory is expensive. Apple Silicon, however, uses Unified Memory and UltraFusion to fuse dies - a tradeoff that favors a large amount of medium-fast memory at a cheaper cost. Unified memory shares a single pool of memory between the CPU and GPU rather than having separate memory for each. There's no need to have separate memory and copy data between the CPU and GPU. UltraFusion is Apple's proprietary interconnect technology for connecting two dies with a super high speed, low latency connection (2.5TB/s). Apple's M2 Ultra is literally two Apple M2 Max dies fused together with UltraFusion. This is what enables Apple to achieve such a high amount of memory (192GB) and memory-bandwidth (800GB/s). Apple M4 Ultra is rumored to use the same UltraFusion technology to fuse together two M4 Max dies. This would give the M4 Ultra 256GB(!!) of unified memory @ 1146GB/s. Two of these could run DeepSeek V3/R1 (4-bit) at 57 tok/sec. All of this and Apple has managed to package this in a small form-factor for consumers with great power efficiency and great open-source (uncharacteristic of Apple!) software. MLX has made it possible to leverage Apple Silicon for ML workloads and exolabs has made it possible to cluster together multiple Apple Silicon devices to run large models, demonstrating DeepSeek R1 (671B) running on 7 M4 Mac Minis. It's unclear who will build the best AI models, but it seems likely that AI will run on American hardware, on Apple Silicon. image
I would love to see more Nostr+AI. Having something like DeepSeek on Nostr will be very useful. I became used to use Grok on X and it will be awesome to have our open version of it here
B 's avatar
B 11 months ago
I’ve given up trying to read any serious articles tonight. Too funny 😂 Time for sleep. At this rate I’ll be giggling in my dreams!
B 's avatar
B 11 months ago
Just tried to close my eyes and my brain took me straight to the “And don’t forget the gravy” cartoon. Super old cartoon the steak emoji has now linked to! 🙄 Oh God my dreams tonight!!!!
Default avatar
Anonpleb 11 months ago
If ChatGPT collects all tge information from users and is able to put together profiles on their users, I assume Deepseek does as well. Is giving China all that data a wise thing to do as an individual?
Default avatar
npub1e3fl...3mjr 11 months ago
Ah yes, because nostr has a wide broad range of content that has nothing to do with bitcoin
21seasons's avatar
21seasons 11 months ago
At least it's better than giving your data to US or EU
Analogue Dog's avatar
Analogue Dog 11 months ago
Becuause users have not given permission for their data to be ingested and/or used by AI.
Default avatar
Rand 11 months ago
blob/Blob & i live with A BOB*/)_____lolz*
Default avatar
Rand 11 months ago
blah, blah/BLOB n_n*/
SinedinZigan's avatar
SinedinZigan 11 months ago
Keep AI out of here and let it be a last place of heaven.
Default avatar
npub17fnq...j4zs 11 months ago
Some examples off the top of my head, given a self-hosted Deepseek model: - provide a summary of notes posted to your feed over the last X hours - get notes about a specific topic - generate a note about a topic, link or attachment - language translation - zap all notes based on some rule. “Zap everyone who’s shitting on Sam Altman 21 sats”
Default avatar
kpr797 10 months ago
Run it locally on your computer instead.