OverviewExploreTrending
Nostr Archives
OverviewExploreTrending
22dcad9…6ab38713d ago
i've experimented a bit, lm studio, vllm, on a ryzen 5 ai + 96GB system ram, the problem isn't really the inference, it's the prefill that's super slow
💬 0 replies

Replies (0)

No replies yet.