Again, memory bandwidth is pretty much all that matters here. During inference o... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		adastra22 4 days ago \| parent \| context \| favorite \| on: macOS 26.2 enables fast AI clusters with RDMA over... Again, memory bandwidth is pretty much all that matters here. During inference or training the CUDA cores of retail GPUs are like 15% utilized.

my123 3 days ago [–]

Not for prompt processing. Current Macs are really not great at long contexts

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact