More

agentifysh · 2025-12-16T00:19:53 1765844393

any recommendations for non-Chinese products ?

agentifysh · 2025-12-16T00:16:27 1765844187

Time for a class action lawsuit. You can submit your personal information to a wordpress powered law firm's upload forms in exchange for your twenty bucks without inflation compensation in about 5 years and they collect a cool 50% fee distributed amongst millionaire lawyers.

hackthemack · 2025-12-16T00:42:50 1765845770

Something is seriously wrong with the US justice system. Some links to bolster your point.

https://waldenconsultants.com/2020/04/13/yet-another-study-s...

https://en.wikipedia.org/wiki/High-Tech_Employee_Antitrust_L...

xrd · 2025-12-16T01:14:20 1765847660

I wish I could do the reverse. Could I and a million other people pay $20 now to a few law firms that could fight this without need for compensation and do everything to expose this to everyone in America?

gruez · 2025-12-16T01:48:39 1765849719

Isn't that what organizations like the ACLU are for? Except ACLU fights for civil rights whereas your hypothetical organization fights for consumer rights. The reason why it doesn't exist is that it suffers heavily from the free rider problem. Any individual's donation of $20 or whatever is unlikely to get them $20 worth of returns, because the lawsuit is either funded or not. Moreover you'd benefit regardless of whether you donated or not, so there's no incentive to donate.

citizenkeen · 2025-12-16T02:39:38 1765852778

But isn’t that true of the ACLU as well?

frumenty · 2025-12-16T01:50:04 1765849804

A similar idea that's immediately actionable is subscribing to independent media doing investigative journalism

agentifysh · 2025-12-16T01:46:29 1765849589

hmmm that is very interesting wonder if its possible even

agentifysh · 2025-12-15T21:21:44 1765833704

not really projects but productivity tools for codex/claude code etc

https://github.com/agentify-sh/safeexec/ - this will prevent rm -rf or git reset --hard being run by off chance when used with --dangerously-bypass-permissions flag

https://github.com/agentify-sh/10x/ - codex productivity booster which adds skills (most likely need to be phased out now since codex has skills), redundant checkpoint/backups with git and jj, subagents, parallel agents, PLAN/RUN/THINK modes.... im actually not sure about releasing this anymore because of how much better codex has gotten.

agentifysh · 2025-12-15T21:12:52 1765833172

we have so many failure-as-a-feature ops these days im surprised we aren't discussing it more. something that consistently happens with enough frequency without any repercussions ultimately just becomes a feature of its own.

we consistently have data breaches in institutions we trust is converging to a point where its literally just a data harvesting ops and everybody stops caring. They won't even bother to join class action lawsuits anymore because the rewards enrich the lawyers while everybody gets their twenty bucks in the mail after providing more personal data to the law firm its like a loophole.

we now have legalized insider trading in the form of "prediction markets", legalized money laundering and pump and dump through crypto, all of these always lead to failures for the participant disguised as wins.

agentifysh · 2025-12-15T20:56:18 1765832178

This article is the first time I am hearing about it

agentifysh · 2025-12-15T20:51:43 1765831903

I'm curious why a Canadian is so hell bent on causing more division in America by embedding his political views in an otherwise decent vulnerability analysis.

He makes it sound he's on some sort of a mission...like the users of the messaging app ( which I have never heard of before until today ) should face some sort of backlash for their own political views opposite of him....which is amusing to say the least as Canadians seem to have permanently marked conservatives, not just in their own country but all over the world as "MAGA".

also I'd appreciate if we can keep politics out which just detracts focus on technical end of things

verdverm · 2025-12-15T21:28:52 1765834132

> I'd appreciate if we can keep politics out

This is an app specifically built for a specific political group, a group that is wreaking havoc on our science and technology. "MAGA" has become the go-to term for a global movement, because there is a global alt-right movement to undo progress and dominate others into their world view.

It's going to be a part of HN like it was the first go around. Being apolitical is how political groups like this come to power.

agentifysh · 2025-12-15T22:02:26 1765836146

same argument can be made for bluesky or reddit pretty much any platform you slap political labels on and this only increases division and radicalizes people on the fringes and desperate for a sense of belonging to as surrogacy for loneliness

verdverm · 2025-12-15T22:10:01 1765836601

Do you want the alt-right to take over? If your answer is no, then understand we need to talk about it all the time to fight back.

They want us to _not talk_ about what they are doing so we _remain ignorant of each other_ think about what they are doing, so they can get away with more

agentifysh · 2025-12-16T00:13:03 1765843983

No but do you want the alt-left to take over? I'm for neither side and im tired of the constant ideological battles

verdverm · 2025-12-16T00:31:56 1765845116

We need to talk about both of them, not neither

You want constant ideological battles to end, and the answer is... do nothing?

They have the megaphone. If you want to take it away, we have to talk to each other about it so they start marginalizing their posts and opinions. MAGA is the poster child for the Overton shift, it's not going back any amount without effort

groby_b · 2025-12-16T00:32:13 1765845133

You'll need to understand that <blatantly political actor does stupid thing> is a criticism of the actor's stupidity, not the political faction.

If it consistently happens more often for any given political faction, then it's still not an ideological statement, just a realization that not every political direction has an equal commitment to facts and reality.

So, mostly, I'd like the alt-stupids to not take over.

agentifysh · 2025-12-11T22:21:10 1765491670

Looks like they've begun censoring posts at r/Codex and not allowing complaint threads so here is my honest take:

- It is faster which is appreciated but not as fast as Opus 4.5

- I see no changes, very little noticeable improvements over 5.1

- I do not see any value in exchange for +40% in token costs

All in all I can't help but feel that OpenAI is facing an existential crisis. Gemini 3 even when its used from AI Studio offers close to ChatGPT Pro performance for free. Anthropic's Claude Code $100/month is tough to beat. I am using Codex with the $40 credits but there's been a silent increase in token costs and usage limitations.

AstroBen · 2025-12-11T23:43:31 1765496611

Did you notice much improvement going from Gemini 2.5 to 3? I didn't

I just think they're all struggling to provide real world improvements

chillfox · 2025-12-12T06:40:21 1765521621

Gemini 3 Pro is the first model from Google that I have found usable, and it's very good. It has replaced Claude for me in some cases, but Claude is still my goto for use in coding agents.

(I only access these models via API)

neuah · 2025-12-12T13:40:15 1765546815

Using it in a specialized subfield of neuroscience, Gemini 3 w/ thinking is a huge leap forward in terms of knowledge and intelligence (with minimal hallucinations). I take it that the majority of people on here are software engineers. If you're evaluating it on writing boilerplate code, you probably have to squint to see differences between the (excellent) raw model performances. whereas in more niche edge cases there is more daylight between them.

dominotw · 2025-12-13T13:09:28 1765631368

what specalized usecases did you use it on and what were the outcomes.

can you share your experience and data for "leap forward" ?

dcre · 2025-12-12T00:17:13 1765498633

Nearly everyone else (and every measure) seems to have found 3 a big improvement over 2.5.

agentifysh · 2025-12-12T02:42:23 1765507343

oh yes im noticing significant improvements across the board but mainly having 1,000,000 token context makes a ton of difference, I can keep digging at a problem with out compaction.

cmrdporcupine · 2025-12-12T02:14:39 1765505679

I think what they're actually struggling with is costs. And I think they're all behind the scenes quantizing models to manage load here and there, and they're all giving inconsistent results.

I noticed huge improvement from Sonnet 4.5 to Opus 4.5 when it became unthrottled a couple weeks ago. I wasn't going to sign back up with Anthropic but I did. But two weeks in it's already starting to seem to be inconsistent. And when I go back to Sonnet it feels like they did something to lobotomize it.

Meanwhile I can fire up DeepSeek 3.2 or GLM 4.6 for a fraction of the cost and get almost as good as results.

XCSme · 2025-12-11T23:53:31 1765497211

Maybe they are just more consistent, which is a bit hard to notice immediately.

dudeinhawaii · 2025-12-12T04:52:01 1765515121

I noticed a quite noticeable improvement to the point where I made it my go-to model for questions. Coding-wise, not so much. As an intelligent model, writing up designs, investigations, general exploration/research tasks, it's top notch.

free652 · 2025-12-12T03:25:52 1765509952

yes, 2.5 just couldnt use tools right. 3.0 is way better at coding. better than sonnet 4.5/

enraged_camel · 2025-12-12T01:14:29 1765502069

Gemini 3 was a massive improvement over 2.5, yes.

hmottestad · 2025-12-12T07:27:02 1765524422

I’m curious about if the model has gotten more consistent throughout the full context window? It’s something that OpenAI touted in the release, and I’m curious if it will make a difference for long running tasks or big code reviews.

agentifysh · 2025-12-12T08:36:12 1765528572

one positive is that 5.2 is very good at finding bugs but not sure about throughputs I'd imagine it might be improved but haven't seen a real task to benchmark it on.

what I am curious about is 5.2-codex but many of us complained about 5.1-codex (it seemed to get tunnel visioned) and I have been using vanilla 5.1

its just getting very tiring to deal with 5 different permutations of 3 completely separate models but perhaps this is the intent and will keep you on a chase.

BrtByte · 2025-12-12T15:44:13 1765554253

The speed bump is nice, but speed alone isn't a compelling upgrade if the qualitative difference isn't obvious in day-to-day use

fellowniusmonk · 2025-12-13T08:55:08 1765616108

5.2 is performing worse in technical reading comprehension for information and logic dense puzzles. It's way more confidently wrong and stubborn about understanding definitions of words.

agentifysh · 2025-12-06T22:14:02 1765059242

which pixel model is best for grephene? I strongly prefer long battery life.

will other phones be supported? why only pixel?

eks391 · 2025-12-06T23:20:30 1765063230

The OS is not very relevant to the Pixel. Compare the Pixels you like that are new (GrapheneOS drops support as models become older flagships, I think for security reasons) and get that one. IIRC, currently only Pixel is allowed, because the bootloader can be opened without rooting the device.

https://grapheneos.org/faq#device-support

morserer · 2025-12-07T17:42:23 1765129343

Unrelated to bootloader or rooting. Pixels are the only phones that adhere to the device requirements that are listed in the FAQ.

https://grapheneos.org/faq#future-devices

mac-attack · 2025-12-06T22:24:47 1765059887

https://grapheneos.org/faq#recommended-devices

agentifysh · 2025-12-05T19:29:17 1764962957

impressive.....most impressive

its going to reach low 90s very soon if trends continue

agentifysh · 2025-12-05T19:23:37 1764962617

im realizing how much of a bottleneck vision models are

im just a glorified speedreadin' promptin' QA at this point with codex

once it replaces the QA layer its truly over for software dev jobs

future would be a software genie where on aistudio you type: "go make counterstrike 1.6 clone, here is $500, you have two hours"

edit: saw the Screenspot benchmark and holy ** this is an insane jump!!! 11% to 71% even beating Opus 4.5's 50%...chatgpt is at 3.5% and it matches my experience with codex

alex1138 · 2025-12-05T19:25:02 1764962702

> once it replaces the QA layer its truly over for software dev jobs

Maybe. However, with CYA requirements being everywhere in industry, there would have to be 100 waiver forms signed. I-promise-not-to-sue-company-if-AI-deletes-the-entire-database

It won't happen for that reason alone. Oh who am I kidding of course it will