I noticed that Starbucks mobile ordering was down and thought “welp, I guess I’l...

pants2 · 2025-10-29T17:01:52 1761757312

Good thing HN is hosted on a couple servers in a basement. Much more reliable than cloud, it seems!

dang · 2025-10-29T17:57:08 1761760628

Just don't use genetically identical hardware:

https://news.ycombinator.com/item?id=32031639

https://news.ycombinator.com/item?id=32032235

Edit: wow, I can't believe we hadn't put https://news.ycombinator.com/item?id=32031243 in https://news.ycombinator.com/highlights. Fixed now.

hinkley · 2025-10-29T20:11:21 1761768681

I’ve seen this up close twice and I’m surprised it’s only twice. Between March and September one year, 6 people on one team had to get new hard drives in their thinkpads and rebuild their systems. All from the same PO but doled out over the course of a project rampup. That was the first project where the onboarding docs were really really good, since we got a lot of practice in a short period of time.

Long before that, the first raid array anyone set up for my (teams’) usage, arrived from Sun with 2 dead drives out of 10. They RMA’d us 2 more drives and one of those was also DOA. That was a couple years after Sun stopped burning in hardware for cost savings, which maybe wasn’t that much of a savings all things considered.

gogusrl · 2025-10-29T19:41:12 1761766872

I got burnt by this bug on freakin' Christmas Eve 2020 ( https://forum.hddguru.com/viewtopic.php?f=10&t=40766 ). There was some data loss and a lot of lessons learned.

praccu · 2025-10-30T02:27:58 1761791278

Many years ago (13?), I was around when Amazon moved SABLE from RAM to SSDs. A whole rack came from a single batch, and something like 128 disks went out at once.

I was an intern but everyone seemed very stressed.

airstrike · 2025-10-29T17:59:51 1761760791

I love that "Ask HN: What'd you do while HN was down?" was a thing

Cthulhu_ · 2025-10-30T15:03:13 1761836593

My plan B was going to the Stack Exchange homepage for some interesting threads but it got repetitive.

Cthulhu_ · 2025-10-30T15:02:10 1761836530

Man I hit something like that once, a SSD had a firmware bug where it would stop working at an exact number of hours.

lysace · 2025-10-29T17:17:42 1761758262

It was on AWS at least (for a while) in 2022.

https://news.ycombinator.com/item?id=32030400

jjice · 2025-10-29T18:01:09 1761760869

Yeah looks like they're back on M5.

dang saying it's temporary: https://news.ycombinator.com/item?id=32031136

    $ dig news.ycombinator.com

    ; <<>> DiG 9.10.6 <<>> news.ycombinator.com
    ;; global options: +cmd
    ;; Got answer:
    ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 54819
    ;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 0, ADDITIONAL: 1

    ;; OPT PSEUDOSECTION:
    ; EDNS: version: 0, flags:; udp: 512
    ;; QUESTION SECTION:
    ;news.ycombinator.com.  IN A

    ;; ANSWER SECTION:
    news.ycombinator.com. 1 IN A 209.216.230.207

    ;; Query time: 79 msec
    ;; SERVER: 100.100.100.100#53(100.100.100.100)
    ;; WHEN: Wed Oct 29 13:59:29 EDT 2025
    ;; MSG SIZE  rcvd: 65

And that IP says it's with M5 again.

parliament32 · 2025-10-29T17:07:30 1761757650

Always has been.

Havoc · 2025-10-29T21:24:36 1761773076

The sysadmin subreddit tends to beat hn on outage reports by an hour+ in my experience.

Bunch of on-call peeps over there that definitely know the instant something major goes down

sergiotapia · 2025-10-29T17:05:10 1761757510

Wow I just left a Starbucks drivethru line because it was just not moving. I guess it was because of this.

iso1631 · 2025-10-29T19:40:36 1761766836

You'd think that Starbucks execs would be held accountable for the fragile system they have put in place.

But they won't be.

peanut-walrus · 2025-10-29T21:00:09 1761771609

Why? Starbucks is not providing a critical service. Spending less money and resources and just accepting the risk that occasionally you won't be able to sell coffee for a few hours is a completely valid decision from both management and engineering pov.

iso1631 · 2025-10-30T10:37:47 1761820667

If I were a Starbucks shareholder I wouldn't be happy that my company is throwing away revenue because of the CTO's decision to outsource accountability

Time and time again it's shown that AWS is far more expensive than other solutions, just easier for the Execs to offshore the blame.

bobro · 2025-10-30T05:20:26 1761801626

Or maybe we should throw them in jail.

DaSHacka · 2025-10-30T08:15:06 1761812106

I agree, but because the coffee is crap

munchlax · 2025-10-30T08:19:26 1761812366

And ridiculously expensive

bombcar · 2025-10-30T09:25:03 1761816303

It's absolutely batshit that an in-person transaction with cash becomes impossible when the computers are down.

I've seen it multiple times at various stores; only once did I see them taking cash and writing things down (probably to enter into the system later when it came back up).

hypeatei · 2025-10-29T16:57:23 1761757043

Starbucks mobile was down during the AWS outage too...

SoftTalker · 2025-10-29T17:00:08 1761757208

They are multi-cloud --- vulnerable to all outages!

mring33621 · 2025-10-29T17:13:29 1761758009

you wouldn't believe some of the crap enterprise bigco mgmt put in place for disaster recovery.

they think that they are 'eliminating a single point of failure', but in reality, they end up adding multiple, complicated points of mostly failure.

Hamuko · 2025-10-29T16:58:36 1761757116

Gonna build my application to be multicloud so that it requires multiple cloud platforms to be online at the same time. The RAID 0 of cloud computing.

andoma · 2025-10-29T16:59:48 1761757188

Go multi-cloud they said...

Theodores · 2025-10-29T19:36:43 1761766603

My inner Nelson-from-the-Simpsons wishes I was on your team today, able to flaunt my flask of tea and homemade packed sandwiches. I would tease you by saying 'ha ha!' as your efforts to order coffee with IP packets failed.

I always go everywhere adequately prepared for beverages and food. Thanks to your comment, I have a new reason to do so. Take out coffees are actually far from guaranteed. Payment systems could go down, my bank account could be hacked or maybe the coffee shop could be randomly closed. Heck, I might even have an accident crossing the road. Anything could happen. Hence, my humble flask might not have the top beverage in it but at least it works.

We all design systems with redundancy, backups and whatnot, but few of us apply this thinking to our food and drink. Maybe get a kettle for the office and a backup kettle, in case the first one fails?

01284a7e · 2025-10-29T18:27:01 1761762421

Ha, maybe rethink the I AM NOTHING BUT A HUGE CLOUD CONSUMER thing on some fundamental levels? Like food?

port11 · 2025-10-29T20:09:13 1761768553

I noticed it when my Netatmo rigamajig stopped notifying me of bad indoor air quality. Lovely. Why does it need to go through the cloud if the data is right there in the home network…

pasc1878 · 2025-10-30T14:42:18 1761835338

Same here for netatmo - ironically I replied to an incident report with netatmo saying all was OK when the whole system was falling over.

However netatmo does need to have a server to store data as you need to consolidate acreoss devices plus you can query gfor a year's data and that won't and can't be held locally.

port11 · 2025-10-31T12:21:37 1761913297

It could be local-first. I don't mind the cross-device sync being done centrally, of course, but the app specifically asks for access to Home and Local Network. I wonder if Home Assistant could deal with blackouts…

garbagewoman · 2025-10-30T07:14:25 1761808465

Service culture is so hollow

jeffrallen · 2025-10-29T20:24:24 1761769464

You know you can talk to your barista and ask for a bagel, right? If you're lucky they still take cash... if you still _have_ cash. :)

0_____0 · 2025-10-30T02:23:29 1761791009

I was at a McDs a couple months back and I'm pretty sure you had to use the kiosk to order. Some places are deprecating the cashier entirely.