Overview
Explore
Trending
Nostr Archives
Overview
Explore
Trending
2
2dcad9…6ab387
13d ago
i've experimented a bit, lm studio, vllm, on a ryzen 5 ai + 96GB system ram, the problem isn't really the inference, it's the prefill that's super slow
💬 0 replies
❤️
0
Reactions
🔁
0
Reposts
⚡
0
Zaps
Replies (0)
No replies yet.