That was the first one I got working, it's been a couple years. Its a project called subsai. It's meant to generate subtitles for videos using whisper. But once I got it working I'm not really using most of its features just using batch files to control whisper. I have not tried to connect it...
In my experience speed is still better only doing a partial cpu offload and getting as many layers on the gpu as possible. But again it's very old cpus. I'm primarily running gemma-3-27b-it-qat-q4_0 as it fits entirely in the gpu unless I need to turn the context way up for a large prompt...
It's been working fine, I'm able to run fairly sizable models at reasonable speeds. Still on 6 inch 8x riser. I tried several x16 risers none work. Never figured out why. It's in a very old server maybe that has something to do with it. Its a dell r720. I chose that because I already had it...
Just joined. I hope I'm ok to post here. I bought a 32gb sxm2 v100 and one if them 60 dollar adapter boards. It came in today. I've been pulling my hair out all night trying to get it to work. Thinking did I break it, am I an idiot. I'm a bit of a tech hoarder so I've had it in several...
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.