More

humbleharbinger · on Feb 12, 2024

I've worked jobs where I'm paid 60-80k and where I'm paid 100k plus.

I can tell you for sure I cared much less about the job where I wasn't getting paid as much. I would never answer an after work call or go out of my way to follow up organizational or systemic issues. For me working overtime is part of the package when I get paid loads more.

jacobyoder · on Feb 12, 2024

> For me working overtime is part of the package when I get paid loads more.

I've tended to think this way, but it's all relative. Plenty of people who've never had a $50k job will treat the $100k job as the baseline/minimum, and wouldn't ever think of going the extra mile like you do. For you (and me?) - yeah, you're paying "a lot", I'll make sure I do a bit extra/more when needed. But if you've never had less, that $100k is your bottom, and you might only consider doing 'extra' for a $200k job.

humbleharbinger · on Jan 21, 2024

I'm confused, what industry knowledge or specialty does OpenAI have to build chips?

They're a software company that works higher on the stack unless working on cutting AI has given them an edge on chip design.

est31 · on Jan 21, 2024

Yeah there is likely a harware company involved, sitting in the stack above fab operators like TSMC, that's not explicitly named here.

Google at least has relied on broadcom for building their TPUs: https://www.theregister.com/2023/09/22/google_broadcom_tpus/

Looking at how Nvidia share prices developed in the last 12 months should be reason alone to enter the AI accelerator business. And OpenAI does have software experts and a major component of Nvidia's success in AI is that they have a major advantage in software compared to their competitors (the hardware advantage isn't that large). Furthermore, there is Google's "We have no moat, and neither does OpenAI" memo: OpenAI definitely needs to strengthen its moat.

But of course, doing research with pytorch is not the same as developing driver code for some hardware bus or scheduling algorithms.

EA-3167 · on Jan 21, 2024

I wonder if Sam Altman is associated with that hardware company, and stands to profit from it coming into billions.

kylecazar · on Jan 21, 2024

They're apparently not interested in building the chips themselves per the article, but rather 'getting' TSMC or another manufacturer to build them with the funds.

Which raises more questions than it answers. Why does OpenAI even need to be involved -- is this a common situation?

I would think the manufacturers would anticipate the huge impending demand for chips and raise the necessary investments to expand themselves.

throwaway8877 · on Jan 21, 2024

Without any special knowledge I would assume that their motivation is two-fold. First remove the middleman that is potentially adding a huge margin on top of the silicon, second remove the extra fluff on the silicon that is not necessary for their specific use case, making the cost smaller.

humbleharbinger · on Jan 9, 2024

I had a similar experience using danfo.js, another data frame library in js. Copilot straight up hallucinate functionality and method names.

Not a big deal because I just read the docs but it was annoying that I couldn't have copilot just spit out what I need.

im_down_w_otp · on Jan 9, 2024

This is really interesting to see these two posts. I can now imagine where AI tools actually inhibit innovation in many domains simply because they’re optimized for things that are already entrenched and new entrants won’t be in the training data. Further inhibiting adoption compared to existing things and thus further inhibiting enough growth to make it into model updates.

xpe · on Jan 9, 2024

It is a healthy mindset to see this phenomenon as "interesting". I can get there when I dial up my mindfulness, but my default mode here is rather judgy; as in "please ppl! pick the better tool as evaluated over a 4+ hour timeframe (after you've got some muscle memory for the API) instead of a 15 minute evaluation".

Forgive me for ranting here, but have people forgotten how to bootstrap their own knowledge about a new library? Taking notes isn't hard. Making a personal cheat-sheet isn't hard. I say all this AND I use LLMs very frequently to help with technical work. But I'm mindful about the tradeoffs. I will not let the tool steer me down a path that isn't suitable.

I'm actually hopeful: there is an unexpected competitive advantage to people who are willing to embrace a little discomfort and take advantage of one's neuroplasticity.

mncharity · on Jan 9, 2024

> I can now imagine where AI tools actually inhibit innovation [...] new entrants won’t be in the training data

I still imagine the opposite impact... Welcome to no-moats-lang.io! So, you've created yet another new programming language over the holidays? You have a sandbox and LSP server up, and are wondering what to do next? Our open-source LLMs are easily tuned for your wonderful language! They will help you rapidly create excellent documentation, translators from related popular languages, do bulk translation of "batteries" so your soon-to-be-hordes of users can be quickly productive, and create both server and on-prem ChatOverflowPilotBots! Instant support for new language versions, and automatic code update! "LLM's are dynamite for barriers to entry!" - Some LLM Somewhere Probably.

Once upon a time, a tar file with a compiler was MVP for a language. But with little hope of broad adoption. And year by year, user minimum expectations have grown dauntingly - towards extensive infrastructure, docs, code, community. Now even FAMG struggle to support "Help me do common-thing in current-version?". Looking ahead, not only do LLMs seemingly help drop the cost of those current expectations to something a tiny team might manage, but also help drop some of the historical barriers to rapid broad adoption - "Waiting for the datascience and webdev books? ... Week after next."

We might finally be escaping decades of language evolution ecosystem dysfunction... just as programming might be moving on from them? :/

fpgaminer · on Jan 9, 2024

How is that different from humans who prefer tools they know to tools they don't?

FridgeSeal · on Jan 9, 2024

Because it’s like willfully choosing the more painful and difficult tool that occasionally stabs you in the hand, because you’re now used to being stabbed in the hand.

Continuing to choose it in the face of - in their own words - a better option, is a bit mind-boggling to me.

humbleharbinger · on Nov 18, 2023

What the heck are all the notifications on this site?

snvzz · on Nov 18, 2023

I gave up reading the article, due to how distracting that is.

humbleharbinger · on Nov 17, 2023

This was my first thoguht, I think it was a more recent episode. The one where they discussed the open ai phone. Probably in the last 2 months

humbleharbinger · on Oct 20, 2023

I went this route and got taken over by hackers multiples times. It was very worth it. I got taken over by hackers because my password for ssh was "mars". Me and my little brother were sharing it and wanted an easy password (yeah we have ssh keys now).

Anyways, we both learnt a lot (htop tmux etc) . I'm always jealous that he got to learn everything earlier than me. But if he's not better than me then I consider myself a failure wrt being an older brother.

The only drawback is that this doesn't work if you want to do ai stuff. For those use cases I rent a machine on paper space for a cheap hourly rate.

simpaticoder · on Oct 20, 2023

>I got taken over by hackers

I'm a little sad I never was. I started with the Linode "hardening linux guide" and so had a firewall and disabled ssh passwords from day 1. I still have fun looking at the failed attempts on 22 and 443. My server gets so many weird requests, and they used to crash the server. A few iterations and that stopped happening.

Oh, another thing that's worth learning: how to acquire and refresh a Lets Encrypt TLS cert via the ACME protocol. Doing this requires interesting confluence of skills and tools - you must carve out a vestigial http route in your server, and also configure certbot and cron. And working out the bugs takes a few iterations. (You could install Caddy, but where's the fun in that?!)

Making it all work, from scratch, made me feel happy in the same way that when I watch people rebuild carburetors or who build bookshelves from scratch makes them feel. It's not new, it's not innovative, but its good. And it's always more interesting than you'd ever suspect.

adamparsons · on Oct 20, 2023

I thought I had been once, and got a very scary email that came from my own domain, claiming to have gotten into my things, and I fully assumed it was the VPS that got hacked. After calming down and raiding the shit out of everything I realized it was just plain old domain spoofing. Both disappointing and terrifying at the same time!

folivore · on Oct 20, 2023

There is a lot of value in learning things the hard way vs the easy way if there is no real significant harm caused, I think. In many cases you learn more or gain a deeper understanding/respect for the topic, which is worth something in its own

humbleharbinger · on Oct 17, 2023

I never criticize a manager. It has very limited upside on the off chance that a manager takes my feedback seriously. OTOH the downside is tremendous.

I don't think feedback in a corporate setting from someone you have power over can be relied on.

baby · on Oct 17, 2023

This. It would be suicidal to leave a manager a bad review. The only thing that works is anonymous feedback which is how Meta/Facebook is so effective at getting rid of bad managers.

eru · on Oct 17, 2023

Either anonymous feedback, or leaving the team or even company.

humbleharbinger · on July 7, 2023

Very cool but will the threads necessarily wake up deterministically? One may wake up before another but not get cpu before it correct? (forgive me if I'm misunderstanding the code)

colanderman · on July 7, 2023

Yes, the algorithm suffers from a race condition. Time is not a synchronization primitive.

greiskul · on July 7, 2023

Yup. If we are being pedantic, and I am sure the author of this loves being pedantic, any number of factors could cause this algorithm to be incorrect. Put it in a slow enough cpu for instance, and it will output the wrong answer.

And correctness is the single most important part of an algorithm. We can do any problem in constant time if we don't mind our answers are not correct.

MPSimmons · on July 7, 2023

return(4)

humbleharbinger · on June 8, 2023

I wonder can they tell the difference between tracking params and good old unobtrusive query params?

thomaslord · on June 8, 2023

Ultimately I don't think they can. How would they handle a link like `https://example.com/password_reset?prid=ZXhhbXBsZWNsaWNraWQ`?

I'm sure somebody will figure out a way to use multiple seemingly-legitimate parameters to get the same result. Why use ?click_id=aqNERjsdfyqe when you can use ?category=10612550&subcategory=5929127&page=4257344 and transfer the same data without arousing suspicion?

neop1x · on June 9, 2023

Websites can use a single lengthy encrypted parameter to encode everything (query params and tracking data). And then what.. will they break all website links by removing the parameter?

humbleharbinger · on May 23, 2023

Not sure if it answers your question but the paper notes something similar in its discussion of limitations:

> the linear attention of RWKV leads to significant efficiency gains but still, it may also limit the model’s performance on tasks that require recalling minutiae information over very long contexts. This is due to the funneling of information through a single vector representation over many time steps, compared with the full information maintained by the quadratic attention of standard Transformers. In other words, the model’s recurrent architecture inherently limits its ability to “look back” at previous tokens, as opposed to traditional self-attention mechanisms. While learned time decay helps prevent the loss of information, it is mechanistically limited compared to full self- attention.

> Another limitation of this work is the increased importance of prompt engineering in comparison to standard Transformer models. The linear attention mechanism used in RWKV limits the information from the prompt that will be carried over to the model’s continuation. As a result, carefully designed prompts may be even more crucial for the model to perform well on tasks

pico_creator · on May 23, 2023

Prompt design is definitely a huge one shifting to rwkv

Sadly too many folks copy and paste what works for openAI and move on when it fails