OpenSubtitles Hostility

CeeBee@lemmy.world · 15 days ago

Downside is that it won’t work if my internet or power is out

If it doesn’t work when your internet is out, then it’s not local.

CeeBee@lemmy.world · 3 months ago

Did you really just try to excuse and downplay a company claiming full ownership and rights over all user’s data?

CeeBee@lemmy.world · 4 months ago

OK mman, dont pop a vein over this

That’s incredibly rude. At no point was I angry or enraged. What you’re trying to do is minimize my criticism of your last comment by intentionally making it seem like I was unreasonably angry.

I was going to continue with you in a friendly manner, but screw you. You’re an ass (and also entirely wrong).

CeeBee@lemmy.world · edit-2 4 months ago

A lot of what you said is true.

Since the TPU is a matrix processor instead of a general purpose processor, it removes the memory access problem that slows down GPUs and CPUs and requires them to use more processing power.

Just no. Flat out no. Just so much wrong. How does the TPU process data? How does the data get there? It needs to be shuttled back and forth over the bus. Doing this for a 1080p image with of data several times a second is fine. An uncompressed 1080p image is about 8MB. Entirely manageable.

Edit: it’s not even 1080p, because the image would get resized to the input size. So again, 300x300x3 for the past model I could find.

/Edit

Look at this repo. You need to convert the models using the TFLite framework (Tensorflow Lite) which is designed for resource constrained edge devices. The max resolution for input size is 224x224x3. I would imagine it can’t handle anything larger.

https://github.com/jveitchmichaelis/edgetpu-yolo/tree/main/data

Now look at the official model zoo on the Google Coral website.

https://coral.ai/models/

Not a single model is larger than 40MB. Whereas LLMs start at well over a big for even smaller (and inaccurate) models. The good ones start at about 4GB and I frequently run models at about 20GB. The size in parameters really makes a huge difference.

You likely/technically could run an LLM on a Coral, but you’re going to wait on the order of double-digit minutes for a basic response, of not way longer.

It’s just not going to happen.

CeeBee@lemmy.world · 4 months ago

when comparing apples to apples.

But this isn’t really easy to do, and impossible in some cases.

Historically, Nvidia has done better than AMD in gaming performance because there’s just so much game specific optimizations in the Nvidia drivers, whereas AMD didn’t.

On the other hand, AMD historically had better raw performance in scientific calculation tasks (pre-deeplearning trend).

Nvidia has had a stranglehold on the AI market entirely because of their CUDA dominance. But hopefully AMD has finally bucked that tend with their new ROCm release that is a drop-in replacement for CUDA (meaning you can just run CUDA compiled applications on AMD with no changes).

Also, AMD’s new MI300X AI processor is (supposedly) wiping the floor with Nvidia’s H100 cards. I say “supposedly” because I don’t have $50k USD to buy both cards and compare myself.

CeeBee@lemmy.world · 4 months ago

Ya, that just solidifies that you don’t know how to use the word.

How does using a certain operating system equate to “someone who annoys others by correcting small errors”?

CeeBee@lemmy.world · 4 months ago

I’m not sure you know how to use that word.

CeeBee@lemmy.world · edit-2 4 months ago

And you can add as many TPUs as you want to push it to whatever level you want

No you can’t. You’re going to be limited by the number of PCI lanes. But putting that aside, those Coral TPUs don’t have any memory. Which means for each operation you need to shuffle the relevant data over the bus to the device for processing, and then back and forth again. You’re going to be doing this thousands of times per second (likely much more) and I can tell you from personal experience that running AI like is painfully slow (if you can get it to even work that way in the first place).

You’re talking about the equivalent of buying hundreds of dollars of groceries, and then getting everything home 10km away by walking with whatever you can put in your pockets, and then doing multiple trips.

What you’re suggesting can’t work.

CeeBee@lemmy.world · 4 months ago

ATI cards (while pretty good) are always a step behind Nvidia.

Ok, you mean AMD. They bought ATI like 20 years ago now and that branding is long dead.

And AMD cards are hardly “a step behind” Nvidia. This is only true if you buy the 24GB top card of the series. Otherwise you’ll get comparable performance from AMD at a better value.

Plus, most distros have them working out of the box.

Unless you’re running a kernel <6.x then every distro will support AMD cards. And even then, you could always install the proprietary blobs from AMD and get full support on any distro. The kernel version only matters if you want to use the FOSS kernel drivers for the cards.

CeeBee@lemmy.world · 4 months ago

Two* GPUs? Is that a thing? How does that work on a desktop?

I’ve been using two GPUs in a desktop since 15 years ago. One AMD and one Nvidia (although not lately).

It really works just the same as a single GPU. The system doesn’t really care how many you have plugged in.

The only difference you have to care about is specifying which GPU you want a program to use.

For example, if you had multiple Nvidia GPUs you could specify which one to use from the command line with:

CUDA_VISIBLE_DEVICES=0

or the first two with:

CUDA_VISIBLE_DEVICES=0,1

Anyways, you get the idea. It’s a thing that people do and it’s fairly simple.

CeeBee@lemmy.world · edit-2 4 months ago

getting a few CUDA TPUs

https://coral.ai/products/#prototyping-products

Those aren’t “CUDA” anything. CUDA is a parallel processing framework by Nvidia and for Nvidia’s cards.

Also, those devices are only good for inferencing smaller models for things like object detection. They aren’t good for developing AI models (in the sense of training). And they can’t run LLMs. Maybe you can run a smaller model under 4B, but those aren’t exactly great for accuracy.

At best you could hope for is to run a very small instruct model trained on very specific data (like robotic actions) that doesn’t need accuracy in the sense of “knowledge accuracy”.

And completely forgot any kind of generative image stuff.

CeeBee@lemmy.world · 4 months ago

Are CUDAs something that I can select within pcpartpicker?

I’m not sure what they were trying to say, but there’s no such thing as “getting a couple of CUDA’s”.

CUDA is a framework that runs on Nvidia hardware. It’s the hardware that will have “CUDA cores” which are large amounts of low power processing units. AMD calls them “stream processors”.

CeeBee@lemmy.world · 4 months ago

You could also completely forego the GPU and get a couple of CUDAs for a fraction of the cost.

What is this sentence? How do you “get a couple of CUDA’s”?

CeeBee@lemmy.world · 4 months ago

I may be a linux nerd and pedantic

There’s nothing pedantic about using Arch. There’s a reason it and its derivatives are so popular.

CeeBee@lemmy.world · 4 months ago

maybe checkout EndeavourOS

After about a decade of being exclusively on Ubuntu I got fed up with it and moved to EndeavourOS and I love it.

Although I am being tempted by the NixOS crowd, right now I’m perfectly happy with EndeavourOS.

CeeBee@lemmy.world · 9 months ago

I know, but that’s not the case here.

CeeBee@lemmy.world · 9 months ago

I would never use anything other than ZFS. Proxmox is just Debian with a management UI. You can setup disks, volumes, etc, with the web UI. And whatever you can’t do from there you can do via the shell or ssh like you would want other Linux system.

CeeBee@lemmy.world · 9 months ago

I know how debugging works. I’ve been a developer for a couple decades.

I know for a fact that the lines I removed are normal verbose messages and entirely unrelated to my issue. I know not only because I’m a developer and understand the messages, but also because those lines show up every second of every minute of every day. They are some of the most verbose lines in the logs. The scheduled task for the subtitles only runs once a day and finishes within a few minutes.

Also, they weren’t indicative of any code path because of how frequent they were. At such a high frequency it becomes impossible to determine which line came first in multi-threaded or asynchronous tasks.

CeeBee@lemmy.world · 9 months ago

I literally have a pinned tab for a Whisper implementation on github! It’s on definitely my radar to check out. My only concern is how well does it do things like multiple speakers and does it generate SDH subtitles? It’s the type that has those extra bits like “Suspenseful music” and “[groans]”, “[screams]”, etc. All the stuff someone hard of hearing would benefit from.

CeeBee@lemmy.world · 9 months ago

I’ve glanced at bazarr once before, years ago. But maybe it’s worth taking another look at. Thanks for the suggestion.

CeeBee@lemmy.world · 9 months ago