Astro/Solid - Hacker News

$manarth an hour ago

There's this post and thread from 7 weeks ago: https://news.ycombinator.com/item?id=46319657

$StevenNunez 10 hours ago

I do! I have an M3 Ultra with 512GB. A couple of opencode sessions running work well. Currently running GML 4.7 but was on Kimi K2.5. Both great. Excited for more efficiencies to make their way to LLMs in general.

[-]

$circularfoyers 3 minutes ago

The prompt processing times I've heard about have put me off wanting to go that high with memory on the M series (hoping that changes for the M5 series though). What's the average and longest times you've had to wait when using opencode? Has any improvements to mlx helped in that regard?

$satvikpendem 4 hours ago

How's the inference speed? What was the price? I'm guessing you can fit the entire model without quantization?

$UmYeahNo 4 hours ago

Excellent. Thanks for the info!

$satvikpendem 10 hours ago

There are some people on r/LocalLlama using it [0]. Seems like the consensus is while it does have more unified RAM for running models, up to half a terabyte, the token generation speed can be fairly slow such that it might just be better to get an Nvidia or AMD machine.

[0] https://old.reddit.com/r/LocalLLaMA/search?q=mac+studio&rest...

[-]

$UmYeahNo 4 hours ago

Thanks for the link. I'll take a look.

$giancarlostoro 9 hours ago

Not a Mac Studio but I use a basic Macbook Pro laptop with 24 GB of RAM (16 usable as VRAM) and I can run a number of models on it at decent speed, my main bottleneck is context window size, but if I am asking single purpose questions I am fine.

[-]

$UmYeahNo 4 hours ago

Yeah. I'm currently on an Mac Mini m2 Pro with 32GB or ram, and I was so curious how much more I could get out of the Apple ecosystem. Thanks for your perspective.

$StrangeSound 2 hours ago

What models are you running?

$mannyv 9 hours ago

Mine is a M1 ultra with 128gb of ram. It's fast enough for me.

[-]

$UmYeahNo 4 hours ago

Thanks for the perspective!

Ask HN: Anyone Using a Mac Studio for Local AI/LLM?