More

marinhero · 2025-12-08T17:09:09 1765213749

Excellent. This plus OPDS will make for easier transfer of files locally.

wkat4242 · 2025-12-09T04:23:12 1765254192

Yes and with kavita there's now even progress sync with koreader! I use it on my kindles too.

atrus · 2025-12-08T17:18:33 1765214313

Or even not locally!

marinhero · 2025-11-10T07:35:06 1762760106

What a great thing you built!

marinhero · 2025-08-14T17:17:22 1755191842

Serious question but if it hallucinates about almost everything, what's the use case for it?

simonw · 2025-08-14T17:28:55 1755192535

Fine-tuning for specific tasks. I'm hoping to see some good examples of that soon - the blog entry mentions things like structured text extraction, so maybe something like "turn this text about an event into an iCal document" might work?

turnsout · 2025-08-14T17:57:08 1755194228

Google helpfully made some docs on how to fine-tune this model [0]. I'm looking forward to giving it a try!

  [0]: https://ai.google.dev/gemma/docs/core/huggingface_text_full_finetune

CuriouslyC · 2025-08-14T17:57:59 1755194279

Fine tuning messes with instruction following and RL'd behavior. I think this is mostly going to be useful for high volume pipelines doing some sort of mundane extraction or transformation.

iib · 2025-08-14T19:19:57 1755199197

This is exactly the fine-tuning I am hoping for, or I would do if I had the skills. I tried it with gemma3 270M and vanilla it fails spectacularly.

Basically it would be the quickadd[1] event from google calendar, but calendar agnostic.

[1] https://developers.google.com/workspace/calendar/api/v3/refe...

striking · 2025-08-14T17:28:53 1755192533

It's intended for finetuning on your actual usecase, as the article shows.

zamadatix · 2025-08-14T17:29:27 1755192567

I feel like the blog post, and GP comment, does a good job of explaining how it's built to be a small model easily fine tuned for narrow tasks, rather than used for general tasks out of the box. The latter is guaranteed to hallucinate heavily at this size, that doesn't mean every specific task it's fine tuned to would be. Some examples given were fine tuning it to efficiently and quickly route a query to the right place to actually be handled or tuning it to do sentiment analysis of content.

An easily fine tunable tiny model might actually be one of the better uses of local LLMs I've seen yet. Rather than try to be a small model that's great at everything it's a tiny model you can quickly tune to do one specific thing decently, extremely fast, and locally on pretty much anything.

yifanl · 2025-08-14T17:54:49 1755194089

It's funny. Which is subjective, but if it fits for you, it's arguably more useful than Claude.

luckydata · 2025-08-14T17:46:29 1755193589

Because that's not the job it was designed to do, and you would know by reading the article.

mirekrusin · 2025-08-14T21:21:38 1755206498

The same as having a goldfish. You can train it to do a trick I guess.

deadbabe · 2025-08-14T17:30:18 1755192618

Games where you need NPCs to talk random jiberrish.

iLoveOncall · 2025-08-14T17:27:20 1755192440

Nothing, just like pretty much all models you can run on consumer hardware.

cyanydeez · 2025-08-14T17:28:13 1755192493

This message brought to you by OpenAI: we're useless, but atleast theres a pay gate indicating quality!

numpad0 · 2025-08-14T17:35:35 1755192935

robotic parrots?

rotexo · 2025-08-14T17:18:47 1755191927

An army of troll bots to shift the Overton Window?

ants_everywhere · 2025-08-14T17:27:10 1755192430

oh no now we'll never hear the end of how LLMs are just statistical word generators

marinhero · on Nov 14, 2024

This sounds like bro science. Having boring sessions is not the point, Z2 training is designed to build endurance, improve cardiovascular efficiency, and increase your body's ability to burn fat as a fuel source at moderate intensities. It’s not about enduring boredom or embracing “mental pain” but rather about consistently training at a level that is sustainable for extended periods.

marinhero · on Oct 16, 2024

If the truly cared about that they would let us run a desktop OS on their iPhones so we don’t carry laptops all around.

shepherdjerred · on Oct 17, 2024

How many android users actually use their phone for this?

The utility of a MacBook (or any laptop) is the display, keyboard, and trackpad, not the compute

marinhero · on May 16, 2024

Have you tried using an nvim distribution? They take care of all the plugin loading, config and testing. They stay reasonably updated too.

bbkane · on May 16, 2024

I tried using LazyVim, but I didn't put much effort customizing it so I missed my custom config and I found it somewhat janky

scutrell · on May 16, 2024

I think they are a good middle ground, but you're still left with some of the busy work. Further, you're a bit more at the mercy of the maintainer. Likely, Lazyvim isn't going anywhere, but it isn't out of the realm of possibilities either.

whinvik · on May 16, 2024

What distribution would you recommend?

nop_slide · on May 16, 2024

I recently started with Kickstart paired with this video, it's really easy and beginner friendly.

https://github.com/nvim-lua/kickstart.nvim

https://www.youtube.com/watch?v=m8C0Cq9Uv9o

EasyMark · on May 16, 2024

I think lazyvim is one of the more successful/dependable ones

marinhero · on May 7, 2024

I get frustrated seeing this go into the iPad and knowing that we can't get a shell, and run our own binaries there. Not even as a VM like [UserLAnd](https://userland.tech). I could effectively travel with one device less in my backpack but instead I have to carry two M chips, two displays, batteries, and so on...

It's great to see this tech moving forward but it's frustrating to not see it translate into a more significant impact in the ways we work, travel and develop software.

ragazzina · on May 8, 2024

> instead I have to carry two M chips

What's the incentive for Apple to unify them, since you've already given them the money twice?

transpute · on May 8, 2024

> Not even as a VM

WWDC is next month. There's still a chance of iPadOS 18 including a Hypervisor API for macOS/Linux VMs on M4 iPads.

monocularvision · on May 8, 2024

I hope for this every single year. I just don’t see it happening. But I hope I am wrong.

transpute · on May 8, 2024

2022, https://appleinsider.com/articles/22/10/20/apple-rumored-to-...

> A leaker has claimed that Apple is working on a version of macOS exclusive for the M2 iPad Pro ... the exclusivity to M2 iPad Pro could be a marketing push. If the feature is only available on that iPad, more people would buy it.

Based on the M4 announcement, vMacOS could be exclusive to the 1TB/2TB iPad Pro with 16GB RAM that would be helpful for VMs.

Kelteseth · on May 8, 2024

At this point, you would have a better chance of running your own apps by relocating to the EU ;)

xyst · on May 8, 2024

yup - im honestly tired of the Apple ~~~jail~~~ ecosystem.

I love the lower power usage/high efficiency of ARM chips but the locked down ecosystem is a drag.

Just the other day, I was trying to get gpu acceleration to work within a vm on my m1 mac. I think it’s working? But compared to native it’s slow.

I think it’s just a misconfig, somewhere (ie, hypervisor or qemu or UTM or maybe the emulation service in vm).

On other systems (intel/amd + nvidia/radeon) this is more or less a “pass through” but on mac it’s a different beast.

paulmd · on May 9, 2024

gpu passthrough for VMs is not supported on apple silicon period afaik. there may be some "native" renderer built on top of metal but apple doesn't support SR-IOV or "headless passthrough".

https://chariotsolutions.com/blog/post/apple-silicon-gpus-do...

otoh no, it is not "more or less [automatic]" in other hardware either, SR-IOV has been on the enthusiast wishlist for a ridiculously long time now because basically nobody implements it (or, they restrict it to the most datacenter-y of products).

intel iGPUs from the HD/UHD Intel Graphics Technology era have a concept called GVT-g which isn't quite SR-IOV but generally does the thing. Newer Xe-based iGPUs do not support this, nor do the discrete graphics cards.

AMD's iGPUs do not have anything at all afaik. Their dGPUs don't even implement reset properly, which is becoming a big problem with people trying to set up GPU clouds for AI stuff - a lot of times the AMD machines will need a hard power reset to come back.

NVIDIA GPUs do work properly, and do implement SR-IOV properly... but they only started letting you do passthrough recently, and only 1 VM instance per card (so, 1 real + 1 virtual).

Curious what you're using (I'm guessing intel iGPU or nvidia dGPU) but generally this is still something that gets Wendell Level1techs hot and bothered about the mere possibility of this feature being in something without a five-figure subscription attached.

https://www.youtube.com/watch?v=tLK_i-TQ3kQ

It does suck that Apple refuses to implement vulkan support (or sign graphics drivers), I think that's de-facto how people interact with most "hardware accelerated graphics" solutions in vmware or virtualbox, but SR-IOV is actually quite a rare feature, and "passthrough" is not sufficient here since the outer machine still needs to use the GPU as well. The feature point is SR-IOV not just passthrough.

LeoPanthera · on May 7, 2024

UTM can be built for iOS.

zamadatix · on May 7, 2024

Hypervisor.framework is not exposed without a jailbreak which makes this quite limited in terms of usability and functionality.

xyst · on May 8, 2024

best you can hope for is cpu pass through. Gl with using the rest of the chip

_akhe · on May 7, 2024

Think the play is "consumer AI". Would you really write code on an iPad? And if you do, do you use an external keyboard?

e44858 · on May 7, 2024

Tablets are the perfect form factor for coding because you can easily mount them in an ergonomic position like this: https://mgsloan.com/posts/comfortable-airplane-computing/

Most laptops have terrible keyboards so I'd be using an external one either way.

_akhe · on May 7, 2024

Those keyboards are absolutely ridiculous, sorry.

marinhero · on May 7, 2024

Yes. If I’m plugging it to a thunderbolt dock I’d expect it to work like a MacBook Air

marinhero · on March 13, 2024

I loved Children of Time. Made me appreciate multi-generational stories for the first time. Also, the ants!

digging · on March 13, 2024

If you haven't read it, the Octavia Butler trilogy called Lilith's Brood is a great multi-generational story as well.

marinhero · on Jan 29, 2024

You can download it and run it with [this](https://github.com/oobabooga/text-generation-webui). There's an API mode that you could leverage from your VS Code extension.

marinhero · on Oct 26, 2023

How well do LLMS like this work with a non-English language? Or are these open source models limited to English?

simonw · on Oct 26, 2023

Quite a few of the top ranked models on this leaderboard are multilingual: https://huggingface.co/spaces/mteb/leaderboard

https://huggingface.co/BAAI/bge-large-en-v1.5 FlagEmbedding for example describes itself as covering Chinese and English.

anigbrowl · on Oct 26, 2023

Stability has a Japanese port which is getting lots of work https://twitter.com/StabilityAI_JP/status/171699857824440759...

m3at · on Oct 26, 2023

This is not an embedding model though. Yes you can always extract some embeddings from somewhere, but for most LLMs those won't perform well for retrieval (which makes sense as it's not what the models are optimizing for)

anigbrowl · on Oct 26, 2023

This isn't an embedding model, but it is a group of people working in this general area in a language other than English. Maybe they'll get to an embedding model next?

ttul · on Oct 26, 2023

That depends on whether the training data contained languages other than English.