I see this is for smaller models. Can I use this as well for ~100B parameter LLM's? Would prefer to do locally if I can; I do have access to hardware to do this

Replies (2)