More

Someone1234 · 2026-04-26T03:07:12 1777172832

Does the free plan even have access to thinking models?

jychang · 2026-04-26T03:08:27 1777172907

Technically yes, gpt-5.4-mini is available on the free plan

Someone1234 · 2026-04-25T14:56:00 1777128960

I'm not following this. PRs are the first time your reviewers have seen that change, so there is no opportunity downstream to do the things you're suggesting.

You're essentially suggesting pre-PRs, but it is circular, since those same pre-PRs would have the same criticism.

PRs are about isolated core changes, not feature or system design. They answer how not why.

bulbar · 2026-04-25T15:12:37 1777129957

> You're essentially suggesting pre-PRs, but it us circular, since those same pre-PRs would have the same criticism.

Walking this road to the end you get pair programming.

esafak · 2026-04-25T15:25:08 1777130708

You get to design committees where everything has to be approved in advance.

Someone1234 · 2026-04-25T16:38:15 1777135095

Yep, where productivity goes to die and your developers feel no autonomy/trust.

ljm · 2026-04-25T15:31:40 1777131100

Usually by the time a PR has been submitted it's too late to dig into aspects of the change that come from a poor understanding of the task at hand without throwing out the PR and creating rework.

So it's helpful to shift left on that and discuss how you intend to approach the solution. Especially for people who are new to the codebase or unfamiliar with the language and, thanks to AI, show little interest in learning.

Obviously not for every situation, but time can be saved by talking something through before YOLOing a bad PR.

nightpool · 2026-04-25T15:48:48 1777132128

Yes, it should be cheap to throw out any individual PR and rewrite it from scratch. Your first draft of a problem is almost never the one you want to submit anyway. The actual writing of the code should never be the most complicated step in any individual PR. It should always be the time spent thinking about the problem and the solution space. Sometimes you can do a lot of that work before the ticket, if you're very familiar with the codebase and the problem space, but for most novel problems, you're going to need to have your hands on the problem itself to get your most productive understanding of them.

I'm not saying it's not important to discuss how you intend to approach the solution ahead of time, but I am saying a lot about any non-trivial problem you're solving can only be discovered by attempting to solve it. Put another way: the best code I write is always my second draft at any given ticket.

More micromanaging of your team's tickets and plans is not going to save you from team members who "show little interest in learning". The fact that your team is "YOLOing a bad PR" is the fundamental culture issue, and that's not one you can solve by adding more process.

n_e · 2026-04-25T17:36:11 1777138571

I'm not sure what approach you're suggesting?

Asking a more junior developer or someone who "show little interest in learning" to discuss their approach with you before they've spent too much time on the problem, especially if you expect them to take the wrong approach seems like the right way to do things.

Throwing out a PR of someone who doesn't expect it would be quite unpleasant, especially coming from someone more senior.

embedding-shape · 2026-04-25T16:04:37 1777133077

> PRs are the first time your reviewers have seen that change, so there is no opportunity downstream to do the things you're suggesting.

Yes, but I'm arguing for that it shouldn't be the first time they hear about that this change is planned and happening, and their input should have taken into account before the PR is even opened, either upfront/before or early in development. This eliminates so many of the typical PR reviews/comments.

Figure out where you are going before you start going there, instead of trying to course correct after people already walked places.

Someone1234 · 2026-04-25T13:12:42 1777122762

Great site, thanks for the link. But holy heck, that "Also Known As" column is complete chaos. What the heck is wrong with the USB Consortium, do they have brain damage?

Also, according to that table, "USB4 Gen 2×2" is a downgrade on "USB 3.2 Gen 2x2", since the cable length is 0.8m instead of 1m for the same speeds. Which is uhh unexpected.

numpad0 · 2026-04-26T04:07:27 1777176447

It allows manufacturers to clear old stocks of cables by rebranding them as latest products.

USB 1+2/3/4 are basically unrelated standards under the same USB umbrella. USB4 especially is just Thunderbolt/PCIe x4 with features. If Betamax was branded as "VHS 2.0" instead of being a separate standard it would have been felt similar to the USB4 situation.

wpm · 2026-04-25T16:15:08 1777133708

Yeah I what I would give to have been a fly on the wall in the room where they decided to roll with such an obviously terrible and stupid naming scheme. Did anyone protest? Did anyone boldly dissent? Or did they all really just sit around and pat themselves on the back?

lpcvoid · 2026-04-25T21:08:06 1777151286

I really, really wish somebody would explain to me what thr USB consortium was smoking, yeah. I cannot explain it.

BearOso · 2026-04-25T13:52:16 1777125136

The cable length is only for the spec. You can get longer cables that achieve the higher bandwidth, they're just not certified for that.

Someone1234 · 2026-04-25T14:48:50 1777128530

Right, so per spec it is a downgrade.

mort96 · 2026-04-25T14:08:08 1777126088

And? The question stands, why is the USB 4 spec a downgrade?

BearOso · 2026-04-25T14:56:40 1777129000

Probably because with USB 3.2 2x2 they were reviewing too many longer cables that didn't meet the requirements, so they lowered the length so companies didn't submit them only to fail to get certified. It's worth noting that 1.2m is now in the USB4 spec.

Someone1234 · 2026-04-24T16:51:07 1777049467

It feels more and more like OpenAI/Anthoropic aren't the future but Qwen, Kimi, or Deepseek are. You can run them locally, but that isn't really the point, it is about democratization of service providers. You can run any of them on a dozen providers with different trade-offs/offerings OR locally.

They won't ever be SOTA due to money, but "last year's SOTA" when it costs 1/4 or less, may be good enough. More quantity, more flexibility, at lower edge quality. It can make sense. A 7% dumber agent TEAM Vs. a single objectively superior super-agent.

That's the most exciting thing going on in that space. New workflows opening up not due to intelligence improvements but cost improvements for "good enough" intelligence.

2ndorderthought · 2026-04-24T21:32:39 1777066359

You can run local models on junker laptops for specific tasks that are about as good as last years SOTA. If the manufactured compute hardware shortage wasn't happening a lot more people would be running two months ago SOTA locally right now. Funny thoughts...

echelon · 2026-04-24T16:56:39 1777049799

Open Source isn't even within 50% of what the SOTA models are. Benchmarks are toys, real world use is vastly different, and that's where they seriously lag.

Why should anyone waste time on poorer results? I'd rather pay my $200/mo because my time matters. I'm not a poor college student anymore, and I need more return on my time.

I'm not shitting on open weights here - I want open source to win. I just don't see how that's possible.

It's like Photoshop vs. Gimp. Not only is the Gimp UX awful, but it didn't even offer (maybe still doesn't?) full bit depth support. For a hacker with free time, that's fine. But if my primary job function is to transform graphics in exchange for money, I'm paying for the better tool. Gimp is entirely a no-go in a professional setting.

Or it's like Google Docs / Microsoft Office vs. LibreOffice. LibreOffice is still pretty trash compared to the big tools. It's not just that Google and Microsoft have more money, but their products are involved in larger scale feedback loops that refine the product much more quickly.

But with weights it's even worse than bad UX. These open weights models just aren't as smart. They're not getting RLHF'd on real world data. The developers of these open weights models can game benchmarks, but the actual intelligence for real world problems is lacking. And that's unfortunately the part that actually matters.

Again, to be clear: I hate this. I want open. I just don't see how it will ever be able to catch up to full-featured products.

twobitshifter · 2026-04-24T17:26:43 1777051603

Unless you are getting outside of your comfort zone and taking a month off from your $200 subscription, every other month, I can’t see how you can make the universal claim that the open weights models are all 50% as good. Just today, DeepSeek released a new model, so nobody knows how that will compare, a week ago it was Gemma 4, etc. I’m okay with you making a comparison, but state the model and the timeframe in which it was tested that you are basing your conclusions on.

MostlyStable · 2026-04-24T17:02:24 1777050144

I think that there will come a point when open source models are "good enough" for many tasks (they probably already are for some tasks; or at least, some small number of people seem happy with them), but, as you suggest, it will likely always (for the forseeable future at least) be the case that closed SOTA models are significantly ahead of open models, and any task which can still benefit from a smarter model (which will probably always remain some large subset of tasks) will be better done on a closed model.

The trick is going to be recognizing tasks which have some ceiling on what they need and which will therefore eventually be doable by open models, and those which can always be done better if you add a bit more intelligence.

bachmeier · 2026-04-24T17:35:37 1777052137

> Benchmarks are toys, real world use is vastly different...Why should anyone waste time on poorer results? I'd rather pay my $200/mo because my time matters.

This kind of rhetoric is not helpful. If you want to make a point, then make one, but this adds nothing to the conversation. Maybe open source models don't work for you. They work very well for me.

oceanplexian · 2026-04-24T17:38:05 1777052285

> Benchmarks are toys, real world use is vastly different, and that's where they seriously lag.

I'm not disagreeing per-se but if you think the benchmarks are flawed and "my real world usage" is more reflective of model capabilities, why not write some benchmarks of your own?

You stand to make a lot of money and gain a lot of clout in the industry if you've figured out a better way to measure model capability, maybe the frontier labs would hire you.

bandrami · 2026-04-24T17:20:17 1777051217

> Why should anyone waste time on poorer results?

Because in almost no real-world project is "programming time" the limiting factor?

dymk · 2026-04-24T17:57:06 1777053426

No, it's rate at which you can solve problems, and weaker models waste your time because they don't solve problems at the same speed.

hunterpayne · 2026-04-25T01:04:44 1777079084

No, its the number of debug cycles you need to solve said problems. That's the major attribute that controls dev time. And models require far more than I need. You are paying money to take longer and produce worse code. If its different for you, that's a you problem.

bdangubic · 2026-04-25T01:11:41 1777079501

amazing how often is this repeated on here are some sort of a gospel SWEs pass down to one another to continue this charade. I have worked in this industry for 30+ years on countless projects, last decade+ as consultant - at every single project (every single one) programming time was the limiting factor. there is a whole industry inside our industry dealing with “processes” and “how to estimate” (apparently we are incapable of doing that) and whatnot, all because the actual programming time is always a limiting factor and there isn’t an even close 2nd

hypnoce_fr · 2026-04-25T08:46:22 1777106782

What counts as programming time ? Writing ? Reviewing ? Compiling ? Debugging ? It also depends the industry. From idea to production, the limiting factor is not always writing the code, and in my experience (15years in fintech) it almost has never been. Discussion, alignment, compilation, heavy testing pipelines, shipping, all of this on a 30million line monorepo. On a greenfield 10k line repo, yes, AI really shines. In other cases, it’s currently just a helper on very specific narrow tasks, that is not always programming.

manny_rat · 2026-04-25T18:33:02 1777141982

Agreed, it's very strange. I'm sure there are many projects that are like they describe, but it's certainly not all of them. I have worked as a game dev for over 20 years, and probably 75% of that time my team and I have been coding. AI has been an incredible game changer for me over the past 6 months or so (I was using it quite a bit before then, but the capability became much higher lately). I actually have some free time in my days now while still hitting milestone dates, instead of endless crunching.

bandrami · 2026-04-25T02:04:12 1777082652

That's just not my experience. Making the software in the first place is never even the cost center.

lelanthran · 2026-04-24T21:24:36 1777065876

> Open Source isn't even within 50% of what the SOTA models are.

The gap has been shrinking with each release, and the SOTA has already run into diminishing returns for each extra unit of data+computation it uses.

Do you really want to bet that the gap will not eventually be a hairs breadth?

Someone1234 · 2026-04-24T17:07:22 1777050442

> Open Source isn't even within 50% of what the SOTA models are.

When was the last time you used any of them? Because, a lot of people are actively using them for 9-5 work today, I count myself in that group. That opinion feels outdated, like it was formed a year ago+ and held onto. Or based on highly quantized versions and or small non-Thinking models.

Do you really think Qwen3.6 for a specific example is "50%" as good as Opus4.7? Opus4.7 is clearly and objectively better, no debate on that, but the gap isn't anywhere near that wide. I'd call "20%" hyperbole, the true difference is difficult to exactly measure but sub-10% for their top-tier Thinking models is likely.

cwnyth · 2026-04-24T18:36:57 1777055817

Their opinion is also behind on LibreOffice, too. I won't defend GIMP's monstrosity, but I finished a whole dissertation, do all my regular spreadsheet work (that isn't done via R), and have created plenty of visual mockups with LibreOffice. Plus, I don't have to deal with a spammy Windows environment.

Sure, we use Google Drive, too, but that's just for sharing documents across offices, not for everyday use. For that, the open source model is a clear winner in my book.

vlovich123 · 2026-04-24T17:17:47 1777051067

Qwen3.6 at which model size and quantization? I already think Opus 4.6 is usable but still dumb as bricks. A 20% cut off that feels like it would still be unusable. And that's not even getting to the annoyance of setting everything up to run locally & getting HW that can run it locally which basically looks like a Macbook M4 these days as the x86 side is ridiculously pricey to get decent performance out of models.

Someone1234 · 2026-04-24T20:40:14 1777063214

At their highest model size and quant. We are discussing price and quality at the top, not what you can run on the lower end.

So the starting point is Opus 4.7 pricing and we're contrasting alternatives near the top end (offered across multiple providers).

Also I said 20% was hyperbole, meaning far too high.

vlovich123 · 2026-04-24T21:09:01 1777064941

That makes no sense because the largest Qwen models are not even open weight so I’m not sure how that’s any different.

Someone1234 · 2026-04-25T00:45:58 1777077958

Right, which isn't what we're discussing, since I mentioned "across multiple providers" in every comment about this topic.

Those closed weight models aren't available like we're discussing. They're only available from the vendor that created them.

vlovich123 · 2026-04-25T06:04:43 1777097083

The largest qwen model is similar so I’m not sure what point you’re trying to make. The only ones available are the open weight ones which are the smaller variants and nowhere near within 20% of the closed frontier models.

Someone1234 · 2026-04-25T12:55:41 1777121741

The largest open models are within 20%; they're likely within 10%. Go actually try them and stop making outdated assumptions. You don't need to invest a lot of money either, just pick your favorite vendor, and send out a few prompts.

conrs · 2026-04-24T17:54:52 1777053292

IMO It's a different and new model. We're engineers, and we're rich. It's not going to be good enough for us. But the much larger market by far is all the people who used to HAVE to work with engineers. They now have optionality; the pendulum is going to swing.

swader999 · 2026-04-24T18:03:14 1777053794

Also, this space will (and perhaps already is for some of us) be an arms race. Sure you can go local but hosted will always be able to offer more and if you want to be competitive, you'll need to be using the most capable.

nancyminusone · 2026-04-24T18:22:10 1777054930

People pirate photoshop and office if they don't want to pay for it, making it as "free" as GIMP. If there is a free option people will use it. never underestimate the cheapskates.

kube-system · 2026-04-24T17:16:31 1777050991

There's going to be a day when we look back at $200/mo price tags and say "wow that was cheap".

The breakeven at this price is 6 minutes of productivity per work day for an engineer making $200k.

cheschire · 2026-04-24T18:12:57 1777054377

Okay, but then by that logic a person making only $20k would break even at about an hour.

Are you suggesting that someone making $20k should be spending $200/mo on Claude?

kube-system · 2026-04-24T19:02:45 1777057365

I'm talking about the cost of labor.

If you pay someone $20,000 for labor, and they save 65 minutes worth of labor per day using a $200/mo Claude subscription, you are better off buying the Claude subscription.

kuboble · 2026-04-25T05:01:21 1777093281

I think if you (a company) pay someone for labor, your labor cannot use personal subscription and you have to pay considerably higher api prices.

hrimfaxi · 2026-04-25T14:42:25 1777128145

Most companies don't provide a corporate cell phone and have no problems with answering emails from a personal account. Can't have it both ways.

dragandj · 2026-04-25T10:57:41 1777114661

Who's gonna pay $20,000 for labor that can be done by anyone with a $200/mo subscription?

kube-system · 2026-04-25T14:22:15 1777126935

Nobody, but that doesn’t exist yet. Currently these solutions enhance the productivity of workers, but it can’t quite replace them.

echelon · 2026-04-24T18:30:29 1777055429

Everyone is arguing why I'm wrong or that I should have presented more data.

You've got the real insight with this claim.

This is the way the world is moving. Open source isn't even going where the ball is being tossed. There is no leadership here.

You're spot on.

If the cost to deliver a unit of business automation is:

    A. $1M with human labor

    B. $700k human labor + open source models

    C. $500k human labor + $10,000 in claude code max (duration of project)

    D. $250k with humans + $200k claude code "mythos ultra"

The one that will get picked is option "D".

Your poor college students and hobbyists will be on option "B". But this won't be as productive as evidenced by the human labor input costs.

Option "C" will begin to disappear as models/compute get more expensive and capable.

Option "A" will be nonviable. Humans just won't be able to keep up.

Open source strictly depends on models decreasing their capability gap. But I'm not seeing it.

Targeting home hardware is the biggest smell. It's showing that this is non-serious, hobby tinkery and has no real role in business.

For open source to work and not to turn into a toy, the models need to target data center deployment.

hunterpayne · 2026-04-25T01:02:23 1777078943

You are assuming (imagining) a cost relationship which doesn't exist and when researched was the opposite of what you claim.

kube-system · 2026-04-24T19:40:48 1777059648

Yeah, I don't wanna shit on open source, there will certainly be uses for all different kinds of models.

The real money in this market, though, is going to be made in the C suite, and they don't really care about the model. They don't care if it's open source, closed source, or what it is. They don't want to buy a model. They're interested in buying a solution to their problems. They're not going to be afraid of a software price tag -- any number they spend on labor is far more.

Labor is something like 50%+ of the Fortune 500's operating expenses -- capturing any chunk of this is a ridiculous sum of money.

brazukadev · 2026-04-25T13:13:01 1777122781

This is you playing with imaginary numbers, like Sam Altman is doing for a long time. It won't end well.

echelon · 2026-04-25T21:09:08 1777151348

I'm willing to bet that this is the shape of the future.

Wanna bet on it?

brazukadev · 2026-04-25T22:44:58 1777157098

It is not. Yeah I'm betting already. AI is changing software landscape but it won't be captured by openai and anthropic.

kardos · 2026-04-24T19:02:18 1777057338

If sharing all of your code with the closed providers is OK then it works. If that is a blocker, open weights becomes much more compelling...

joquarky · 2026-04-24T19:41:30 1777059690

What will you do when they stop burning cash and the $200 plan becomes $2000?

brazukadev · 2026-04-24T17:17:58 1777051078

> Open Source isn't even within 50% of what the SOTA models are

Who said so? GLM 5.1 is 90% Opus, at least. Some people quite happy with Kimi 2.6 too. I did not try Deepseek 4 yet but also hearing it is as good as Opus. You might be confusing open source models with local models. It is not easy to run a 1.6T model locally, but they are not 50% of SOTA models.

jawilson2 · 2026-04-24T21:22:22 1777065742

I think the problem is that we're all waiting for the patented Silicon Value Rug Pull and ensuing enshittification, where there are a dozen tiers of products, you need 4 of them, and they now cost $2000/month. I want to hedge against that.

Someone1234 · 2026-04-24T16:04:11 1777046651

From my understanding, that isn't how drivers in Linux work. Nearly no kernels will have that code compiled into them because kconfig won't call for it. It is "opt-in", and it is so niche few Distros would have done so.

Linux only ships with a tiny sub-set of the drivers in the source tree.

Someone1234 · 2026-04-24T15:05:37 1777043137

Part of why you're hitting your limit is that Claude's Pro subscription is completely unusable with the current usage limits. I legitimately mean it when I say, you should cancel.

But to the actual question: A lot of people's gut instinct on how to solve this doesn't work. They start going down the road of "well, if I teach the AI about my legacy codebase, it will be smarter, and therefore more efficient." But all you wind up doing is consuming all of your available context, with irrelevancies, and your agent gets dumber and costs more.

What you actually need to do is tackle it the same way a human would: Break it down into smaller problems, where the agent is able to keep the "entire problem" within context at once. Meaning 256K or less (file lengths + prompt + outputs). Then of course use a scratchpad file that holds notes, file references, constraints, and line numbers. That's your compaction protection. Restart the chat with the same scratchpad when you move between minor areas.

Context is your primary-limited resource. Fill it only with what should absolutely need to be there, and nothing else at all.

thinkingtoilet · 2026-04-24T16:02:07 1777046527

Are you managing your scrathpad file or letting the AI do it? Or both?

Someone1234 · 2026-04-24T16:16:13 1777047373

I have the agent automatically manage its own scratchpad file. But it is meant to be fully disposable; it isn't committed, and is destroyed if you shift areas.

Someone1234 · 2026-04-23T18:30:27 1776969027

I'd like to draw people's attention to this section of this page:

https://developers.openai.com/codex/pricing?codex-usage-limi...

Note the Local Messages between 5.3, 5.4, and 5.5. And, yes, I did read the linked article and know they're claiming that 5.5's new efficient should make it break-even with 5.4, but the point stands, tighter limits/higher prices.

puppystench · 2026-04-23T19:03:38 1776971018

For API usage, GPT-5.5 is 2x the price of GPT-5.4, ~4x the price of GPT-5.1, and ~10x the price of Kimi-2.6.

Unfortunately I think the lesson they took from Anthropic is that devs get really reliant and even addicted on coding agents, and they'll happily pay any amount for even small benefits.

kingstnap · 2026-04-23T19:18:49 1776971929

I feel like devs generally spend someone else's money on tokens. Either their employers or OpenAIs when they use a codex subscription.

If I put on my schizo hat. Something they might be doing is increasing the losses on their monthly codex subscriptions, to show that the API has a higher margin than before (the codex account massively in the negative, but the API account now having huge margins).

I've never seen an OpenAI investor pitch deck. But my guess is that API margins is one of the big ones they try to sell people on since Sama talks about it on Twitter.

I would be interested in hearing the insider stuff. Like if this model is genuinely like twice as expensive to serve or something.

vineyardmike · 2026-04-23T20:52:13 1776977533

You can't build a business on per-seat subscriptions when you advertise making workers obsolete. API pricing with sustainable margins are the only way forward if you genuinely think you're going to cause (or accelerate) reduction in clients' headcount.

Additionally, the value generated by the best models with high-thinking and lots of context window is way higher than the cheap and tiny models, so you need to provide a "gateway drug" that lets people experience the best you offer.

CryptoBanker · 2026-04-24T04:29:06 1777004946

> You can't build a business on per-seat subscriptions when you advertise making workers obsolete.

On the other hand I would argue that most workers' salaries are more like subscriptions than API type pricing (which would be more like an hourly contractor)

ewrs · 2026-04-23T19:21:38 1776972098

Yeah and the increase in operating expenses is going to make managers start asking hard questions - this is good. It means eventually there will be budgets put in place - this will force OAI and Anthropic to innovate harder. Then we will see how things pan out. Ultimately a firm is not going to pay rent to these firms if the benefits dont exceed the costs.

dist-epoch · 2026-04-23T20:12:32 1776975152

> Ultimately a firm is not going to pay rent to these firms if the benefits dont exceed the costs.

This is also true for the humans. They will need to provide more benefits than the coding agents cost.

eiksjs · 2026-04-23T20:35:20 1776976520

Humans are needed to use agents and these agents are not showing to be fully autonomous and require constant human review. In fact all you are getting is a splurge of stuff, people not thinking deeper anymore and the creation of more bottle necks and exacerbating the ones that already exist in an org.

You sound like elon with the fsd will be here next year. Many cars have the self driving feature - most drivers don’t use it. Oh why is that I wonder.

mrwaffle · 2026-04-24T04:22:38 1777004558

Meaning that you believe they're not trying their "hardest" to innovate? They must be slacking then.

girvo · 2026-04-23T21:09:03 1776978543

Budgets are already happening

mitjam · 2026-04-23T20:04:35 1776974675

The difference between sub and api price makes it hard to create competitive solutions on the app level.

irthomasthomas · 2026-04-23T20:31:12 1776976272

This was something I worried about after openai started building apps as well as models. Now all of the labs make no secret of the fact that they are going after the whole software industry. Its going to be hard to maintain functioning fair markets unless governments step in.

w10-1 · 2026-04-23T21:11:07 1776978667

Price increases now aim to demonstrate market power for eventual IPO.

If they can show that people will pay a lot for somewhat better performance, it raises the value of any performance lead they can maintain.

If they demonstrate that and high switching costs, their franchise is worth scary amounts of money.

JohnLocke4 · 2026-04-23T19:27:41 1776972461

Sometimes I wonder if innovation in the AI space has stalled and recent progress is just a product of increased compute. Competence is increasing exponentially[1] but I guess it doesn't rule it out completely. I would postulate that a radical architecture shift is needed for the singularity though

[1]https://arxiv.org/html/2503.14499v1 *Source is from March 2025 so make of it what you will.

nomel · 2026-04-23T19:34:09 1776972849

> that devs get really reliant and even addicted on coding agents

An alternative perspective is, devs highly value coding agents, and are willing to pay more because they're so useful. In other words, the market value of this limited resource is being adjusted to be closer to reality.

killingtime74 · 2026-04-23T20:52:30 1776977550

It's not limited though there are alternative providers even now, much less when the price goes up. Chinese providers, European ones, local models.

nomel · 2026-04-23T21:30:53 1776979853

> It's not limited though

Inference is not free, so all providers have a financial limit, and all providers have limited GPU/memory, so there's a physical material limit.

I suggest looking at the profits of these companies (while they scramble to stay competitive).

scotty79 · 2026-04-24T12:37:43 1777034263

We are constantly getting smaller and faster models that are close in performance to state of the art from few months prior. And that's due to architectural inventions. I'm sure it takes some time for these inventions to proliferate to frontier and that some might not be applicable there but we are definitely going faster than just due to compute increase.

It will get faster, but there are no singularities in the real world. Except possibly black holes, but we can't even be sure of that.

pxc · 2026-04-23T19:28:01 1776972481

Maybe that's true. But I think part of the issue is that for a lot of things developers want to do with them now— certainly for most of the things I want to do with them— they're either barely good enough, or not consistently good enough. And the value difference across that quality threshold is immense, even if the quality difference itself isn't.

Mars008 · 2026-04-24T00:54:31 1776992071

> devs get really reliant and even addicted on coding agents

That's more about managers who hope AI will gradually replace stubborn and lazy devs. That will shift the balance to business ideas and connections out of technical side and investments.

Anyway, before singularity there going to be a huge change.

pzo · 2026-04-23T19:21:08 1776972068

On top of that I noticed just right now after updating macos dekstop codex app, I got again by default set speed to 'fast' ('about 1.5x faster with increased plan usage'). They really want you to burn more tokens.

nubg · 2026-04-23T22:53:58 1776984838

wow wait so it wasn't just me leaving it on from an old session?

sounds like criminal fraud to me tbh

0xbadcafebee · 2026-04-23T20:30:51 1776976251

A fool and his money are soon parted

oh_no · 2026-04-23T19:07:13 1776971233

what's the source on that?

puppystench · 2026-04-23T19:09:27 1776971367

In the announcement webpage:

>For API developers, gpt-5.5 will soon be available in the Responses and Chat Completions APIs at $5 per 1M input tokens and $30 per 1M output tokens, with a 1M context window.

oh_no · 2026-04-23T19:19:07 1776971947

oops, thanks. i had just been looking at their api docs

keyle · 2026-04-24T06:07:13 1777010833

I did one review job that sent off three subagents and I blew the second half of my daily limit in 10 mins 13 seconds. Fun times.

raincole · 2026-04-24T01:59:33 1776995973

It's such a vague table for pricing information. 30-150 messages...? What?

Someone1234 · 2026-04-22T19:07:47 1776884867

You've got me curious. Two questions if I may:

- What kind of tasks/work?

- How is either Qwen/Gemma wired up (e.g. which harness/how are they accessed)?

Or to phase another way; what does your workflow/software stack look like?

syntaxing · 2026-04-22T19:23:24 1776885804

1. Qwen is mostly coding related through Opencode. I have been thinking about using pi agent and see if that works better for general use case. The usefulness of *claw has been limited for me. Gemma is through the chat interface with lmstudio. I use it for pretty much everything general purpose. Help me correct my grammar, read documents (lmstudio has a built in RAG tool), and vision capabilities (mentioned below, journal pictures to markdown).

2. Lmstudio on my MacBook mainly. You can turn on an OpenAI API compatible endpoint in the settings. Lmstudio also has a headless server called lms. Personally, I find it way better than Ollama since lmstudio uses llama cpp as the backend. With an OpenAI API compatible endpoint, you can use any tool/agent that supports openAI. Lmstudio/lms is Linux compatible too so you can run it on a strix halo desktop and the like.

ycombinatornews · 2026-04-23T02:53:06 1776912786

Curious how do you run opencode and qwen locally? Few times I tried it responds back with some nonsense. Chat, say, through ollama works well.

syntaxing · 2026-04-23T12:11:12 1776946272

Which quants are you using? I had similar issue until I used Unsloth’s. I would recommend at least UD_6. Also, make sure your context length is above 65K.

https://huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF

Someone1234 · 2026-04-22T20:45:33 1776890733

Thanks I appreciate the info. I may try to spin up something like this and give it a whirl.

anon373839 · 2026-04-22T22:47:36 1776898056

I would recommend trying oMLX, which is much more performant and efficient than LM Studio. It has block-level KV context caching that makes long chats and agentic/tool calling scenarios MUCH faster.

Someone1234 · 2026-04-20T14:36:29 1776695789

If money is no object, then nothing else is worth considering if it isn't Codex 5.4/Opus 4.7/SOTA. But for many to most people, value Vs. relative quality are huge levers.

Even many people on a Claude subscription aren't choosing or able to choose Opus 4.7 because of those cost/usage pressures. Often using Sonnet or an older opus, because of the value Vs. quality curve.

CamperBob2 · 2026-04-20T15:12:46 1776697966

Cost may or may not be a factor in my choice of model, but knowing the capabilities and knowing they will remain consistent, reliable, and available over time is always a dominant consideration. Lately, Anthropic in particular has not been great at that.

dd8601fn · 2026-04-20T15:11:51 1776697911

Also us weirdos with local model uses. But your point stands.

seplite · 2026-04-20T15:18:26 1776698306

Unfortunately, like with the release of Qwen3.6-Plus, this model also isn’t released for local use. From the linked article: “Qwen3.6-Max-Preview is the hosted proprietary model available via Alibaba Cloud Model Studio”

zozbot234 · 2026-04-20T15:19:39 1776698379

The Max series was never available for local use, though. So this is expected.

dd8601fn · 2026-04-20T18:55:46 1776711346

Sure, not plus or max. I just use their lesser moe ones locally (that would never come close to massive sota models) all the time.

jpfromlondon · 2026-04-20T16:07:28 1776701248

anecdotally the quality of output isn't significantly different, the speed seems to be what you're really paying for, and since the alternative is free I'll stick to local.

paprikanotfound · 2026-04-20T17:13:11 1776705191

What are the best models to run locally?

jpfromlondon · 2026-04-21T08:15:28 1776759328

right now Gemma 4 and Qwen 3.6, I've found the latter to have the slight edge but your results may vary.

elAhmo · 2026-04-20T16:21:42 1776702102

Codex 5.4 is not out?

wahnfrieden · 2026-04-20T14:38:33 1776695913

Codex subscription is very generous at pro tiers

Someone1234 · 2026-04-20T14:28:10 1776695290

With respect, maybe read the article? You're against it, because you didn't read what is being mandated and instead just invented worst-case scenarios instead. You're against your own Strawman.

The proposal is: batteries must be removable using commercially available tools, if the manufacturer requires specialist tools then they must provide them for free.

Essentially they're banning specialized tools, and mandating that repair shops and consumers must be able to purchase replacement batteries for "at least five years."

For context the iPhone was already altered to be compliant with this law and none of the issues you raised were notably worse in the iPhone Air, or 17.

This likely will eliminate specialist software to "sync" batteries, and non-standard screws/attachment mechanisms.

Noumenon72 · 2026-04-20T14:37:15 1776695835

> You're against your own Strawman.

> The proposal is: batteries must be removable using commercially available tools

That's exactly what he's against, plus the premise "Making batteries removable prevents them from being waterproof, dustproof, and collision resistant". Which may be true or false, but not a straw man.

gcanyon · 2026-04-20T21:08:25 1776719305

Thanks, and yes, exactly this. As I acknowledged in my comment, maybe phones can be made waterproof, dustproof, and dropproof while also being user serviceable. If there is a tradeoff, I'll take waterproof over user-battery-replaceable. Apparently conditionals make me a strawman...

Someone1234 · 2026-04-20T14:44:01 1776696241

It absolutely is a Strawman. There's no basis in fact for why using commercial tools instead of specialist tools would result in worse "waterproof, dustproof, or collision resistance." It is completely fictional claim invented whole cloth.

Again, multiple phones have already become compliant with this law and didn't lose or compromise any of those things.

So you OR they, will need to explain the basis for the claim, otherwise it is just a Strawman you're poking baselessly.

Noumenon72 · 2026-04-20T18:38:04 1776710284

I guess the headline is what sets up the straw man -- I didn't think we were arguing about the narrower claim "all phones with replaceable batteries should be removable using commercial tools", just whether they should be replaceable at all. I still think it's reasonable to expect that mandating phones be openable would have tradeoffs in waterproofing, so your disagreement should be factual/historical, not about good faith.