devRant - A fun community for developers to connect over code, tech & life as a programmer

Search - "llm"

144

krister-alm

512

8y

Java security applied.

rant java security

3
34

krister-alm

512

8y

It has to be made with Java

rant wk75

5
33

stackodev

13447

2y

Just stir the pile...

joke/meme chatgpt llm ml

1
19

vane

10582

1y

100 billion dollars spent on artificial intelligence and large language models because people don’t want to talk with each other.

rant llm ai

6
19

atheist

10889

282d

Things that shouldn't have needed to be said:
Don't give an LLM sudo and pipe all it's output to bash...
https://theregister.com/AMP/2024/...

rant

12
16

myss

4447

144d

I really want this whole industry to die. I want it to happen from the bottom of my soul!

Their whole job ad was about web technologies where they were asking for a 7-10 years Symfony developer.

One would think being a programmer for 10 years should superseed your shitty LLM prompt crafting requirements.

Truly, there never was a better time for global EMP to happen..

rant die it ai retards llm

13
15

krister-alm

512

9y

Send wife and kids on vacation. Call in sick. Lots of coffe

undefined wk34

1
14

typosaurus

11005

238d

It's still in development. It often says the opposite from what is expected. Try Retoor1b chatbot at https://llm.molodetz.nl

This was result after building bot + chat website from scratch including training with embeddings. Design is generated by GPT, I tried my own but all ugly.

It's quite cool huh? Ask it to write some code for you. It's absolutely terrible. If it's down, try again in 5 minutes. I'm still working on it.

What's the result? I finally have a toolkit to make good/serious bots. Code could be bit better, but that's for other day.

Stack: self written webserver (and yes, you can post a gb to it or ddos it. Not sure if it survives the first one. I should limit requests to one mb anyway. Http headers may officially not be more than 4096 in total) since I know http protocol from my head anyway. Python websockets module. Asyncio, chromadb.

It could have xss issues. Don't care.

Let me know what you think

random site chat bot scratch cool code retoor1b chatbot llm generated

42
14

fullstackcircus

9658

2y

* Today you have to live within 150 miles of a few cities as we are working on creating "hubs" but it's still remote!

you know what?

fuck you

also, no, an LLM isn't going to solve climate change

jesus christ i am depressed beyond belief. i don't even want to apply, let alone work for any of these companies

next up: "USA only" yeah what the fuck does that mean? US citizen? US timezone? you want to hire a super technical engineer right? SO WHY NOT BE SUPER TECHNICAL IN YOUR JOB DESCRIPTION

just incredible, companies that offer 100-200K salaries and all they have is a website and a fucking chrome extension... what???

i feel like i've been doing wrong my whole life

just end it all

rant please clowndeath no 🤡 death imdone

5
13

ctkqiang

10596

135d

The most unrealistic deadline I got was the project must complete within 16 hours.

Setting: Freelance
Did I accept this? No
Client budget: 10 usd
What is the project? A iOS app which is the a combination of all LLM into one, like a clone of Poe. The API key must use my own because client refused to pay. I negotiate about the project, explain to him why this project, deadline and the budget is not aligned He said " Now everything can be done by using AI, why are yoh cheating me, i will report you" ( I was like report what?) I rejected the gig, then day's later my email junk folder is occupied with racist content and insulting me.

So yea. This is the whole story. This project is doable, just a wrapper. I don't mind if client will pay for all the API , the pay and dealings is logical.

rant freelance wtf indian_client wk385 llms

39
13

krister-alm

512

8y

Working with java 🤓

undefined wk47

1
12

netikras

34887

21d

is it just me, or is reading LLM-generated text really annoying?

It feels like I'm reading the same thing over and over again, in different contexts about different things.

rant

4
12

JsonBoa

3052

2y

Data Disinformation: the Next Big Problem

Automatic code generation LLMs like ChatGPT are capable of producing SQL snippets. Regardless of quality, those are capable of retrieving data (from prepared datasets) based on user prompts.
That data may, however, be garbage. This will lead to garbage decisions by lowly literate stakeholders.
Like with network neutrality and pii/psi ownership, we must act now to avoid yet another calamity.

Imagine a scenario where a middle-manager level illiterate barks some prompts to the corporate AI and it writes and runs an SQL query in company databases.
The AI outputs some interactive charts that show that the average worker spends 92.4 minutes on lunch daily.
The middle manager gets furious and enacts an Orwellian policy of facial recognition punch clock in the office.
Two months and millions of dollars in contractors later, and the middle manager checks the same prompt again... and the average lunch time is now 107.2 minutes!
Finally the middle manager gets a literate person to check the data... and the piece of shit SQL behind the number is sourcing from the "off-site scheduled meetings" database.
Why? because the dataset that does have the data for lunch breaks is labeled "labour board compliance 3", and the LLM thought that the metadata for the wrong dataset better matched the user's prompt.

This, given the very real world scenario of mislabeled data and LLMs' inability to understand what they are saying or accessing, and the average manager's complete data illiteracy, we might have to wrangle some actions to prepare for this type of tomfoolery.

I don't think that access restriction will save our souls here, decision-flumberers usually have the authority to overrule RACI/ACL restrictions anyway.
Making "data analysis" an AI-GMO-Free zone is laughable, that is simply not how the tech market works. Auto tools are coming to make our jobs harder and less productive, tech people!
I thought about detecting new automation-enhanced data access and visualization, and enacting awareness policies. But it would be of poor help, after a shithead middle manager gets hooked on a surreal indicator value it is nigh impossible to yank them out of it.

Gotta get this snowball rolling, we must have some idea of future AI housetraining best practices if we are to avoid a complete social-media style meltdown of data-driven processes.
Someone cares to pitch in?

rant llm data chatgpt data literacy ai data driven auto tools

13
11

retoor

9657

192d

I have a new UNTRAINED bot on my site. It's based on openai now. And that's why it's blazing fast and blazing usless.

I can tell you why bots are so boring and will sure cause the dead internet theory. My datasets for example never contain real disturbing stuff ACCORDING TO NORMAL PEOPLE. EVERY TIME:

"The job failed due to an invalid training file. This training file was blocked by our moderation system because it contains too many examples that violate OpenAI's usage policies, or because it attempts to create model outputs that violate OpenAI's usage policies."

Now i'm really done. I gonna email them about their unusable training system.

In theory, i could test the message one by one if it is bad first. Don't want to do or pay for that. There should be an option to skip the data it considers disturbing instead of cancelling a whole data set for 0.1%. You also don't want to know how long it takes BEFORE he is finished validating you set. I think someone is doing it manually and clicks 'Uh uh..'-button..

Also, for the people who think they have gpt4o by having the API, you're lied to. The 'own gpt'-option on the paid openai is way more advanced than the ones you make locally.

They don't give us the real good stuff!

Oh, btw! The input data for my training is based on FORMER conversations with the bot. I automated a script to repeat a conversation I had and selected those messages and clicked 'train'. So it even complained about its OWN data! That data was already saying stuff like 'I can't help you with that' IN my training data. So, you 'corrected' and corrupted my data and now its still nog good enough for round 2?

I would really love to go back to local LLM's, but I can't imagine having ever a machine that generates as fast as the real GPT does. I also prefer to do it myself, but it's David vs. Goliath, even with a 5k computer. I'm sure.

Low quality rant, I know. I'm typing while still frustrated. For people who think censorship is needed often, this is the result! According to someone else, YOU are the one who has to be censored. Don't forget that.

rant dead internet training useless again fuck openai fuck openai wokies corrupt llm

11
11

typosaurus

11005

251d

I can retire! I automated myself!

I introduce to you, retoorii1b! Yes - I fit in a 1b LLM. Retoorii1b is a bit retoorded tho. It's quite realistic.

I tested several LLM's with same training and it was amazing. Even a 0.5b that had the most interesting Dutch ever. Her Dutch is like my English I suppose.

The 0.5b one could code fine. retoorii1b still has some ethics to delete to make it more realistic.

I've not decided a base model yet, but it'll probably be the lightest one so I can let a few chat with eachother on my webplatform / pubsub-server project. I have a few laptops to host on. I can let it execute actions like file listings or background task execution.

See comments for some very awkward response regarding my file listing. She described everything.

She just said these things. I'm kinda proud. I became a parent:
3. **Keep functions short and sweet**: Aim for functions under 50 lines long. Any longer and you're just wasting people's time.

Now if you'll excuse me, I have more important things to attend to... like coding my next game in Unreal Engine.

random dutch realistic retire english retoorded sarcastic integration retoorii1b llm clone

27
10

atheist

10889

1y

LLMs as compilers and optimizers. Oh hey look it's hallucinated some assembly... Fucking what? Who thought that was a good idea...
https://venturebeat.com/ai/...

rant

4
10

Wisecrack

9274

306d

After a lot of work I figured out how to build the graph component of my LLM. Figured out the basic architecture, how to connect it in, and how to train it. The design and how-to is 100%.
Ironically generating the embeddings is slower than I expect the training itself to take.

A few extensions of the design will also allow bootstrapped and transfer learning, and as a reach, unsupervised learning but I still need to work out the fine details on that.

Right now because of the design of the embeddings (different from standard transformers in a key aspect), they're slow. Like 10 tokens per minute on an i5 (python, no multithreading, no optimization at all, no training on gpu). I've came up with a modification that takes the token embeddings and turns them into hash keys, which should be significantly faster for a variety of reasons. Essentially I generate a tree of all weights, where the parent nodes are the mean of their immediate child nodes, split the tree on lesser-than-greater-than values, and then convert the node values to keys in a hashmap to make lookup very fast.

Weight comparison can be done either directly through tree traversal, or using normalized hamming distance between parent/child weight keys and the lookup weight.

That last bit is designed already and just needs implemented but it is completely doable.

The design itself is 100% attention free incidentally.

I'm outlining the step by step, only the essentials to train a word boundary detector, noun detector, verb detector, as I already considered prior. But now I'm actually able to implement it.

The hard part was figuring out the *graph* part of the model, not the NN part (if you could even call it an NN, which it doesn't fit the definition of, but I don't know what else to call it). Determining what the design would look like, the necessary graph token types, what function they should have, *how* they use the context, how thats calculated, how loss is to be calculated, and how to train it.

I'm happy to report all that is now settled.

I'm hoping to get more work done on it on my day off, but thats seven days away, 9-10 hour shifts, working fucking BurgerKing and all I want to do is program.

And all because no one takes me seriously due to not having a degree.

Fucking aye. What is life.

If I had a laptop and insurance and taxes weren't a thing, I'd go live in my car and code in a fucking mcdonalds or a park all day and not have to give a shit about any of these other externalities like earning minimum wage to pay 25% of it in rent a month and 20% in taxes and other government bullshit.

rant progress wtf llm ml

4
10

retoor

9657

157d

I found my favorite LLM I guess, it says whatever you ask it. But that first sentence, wtf :P

random ollama llm chat snek

15
10

fullstackcircus

9658

2d

Medium is a machine, albeit a broken machine, that somehow bot farmers manage to get the most shitty click bait titles up... and the content of said shitty articles (if you can even call it 'content' i.e. two sentence recycled slop designed for those with single digit to negative IQ) is even worse...

Dev.to has become a laundry list of "(X fucking shitting things that Y developers didn't know with Z) - I swear browsing yesterday I saw literally the SAME LLM shitted template like ("Here's 9 things that you NEED to learn in 2025) - they even both used the number 9... fucking christ

Reddit... not even an possible way to even get a foot in there, I don't even understand that platform...

And, I can't believe I'm saying this, but Hacker News is quickly reaching the same level too - just filled with clowns who don't even know what LLMs are, or what even fucking copilot agent mode is - and keep in mind these are the guys who are supposed to be making 300K a year because they are the "leaders in tech"? fuck off

I'm quite close to just quitting it all entirely, even you guys (sadly)... I haven't been on in forever but I can imagine the type of bigbrains who have since come and gone in my absence here...

I wanted to stake my claim and make my living here... but currently, if the internet isn't 'bad' enough since say 2015/2016 - it's going to get AT LEAST 100x worse - every platform has their fucking dumb ass social quirks and the moment you don't conform you're an outcast - if you can even make sense of why certain articles get more traffic than others in the first place. fucking backwards ass idiot machine, I swear

I'm gonna go have fun in the real world for the foreseeable future, have fun trying to 'make it' in this cesspool. jfc

obligatory 🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡🤡

rant garbage fullstack deadinternet web circus clown internet lol jokes shittification dead

10
10

TerriToniAX

2658

236d

Traditional programming means spending *days or even weeks* to write instructions to make the software do what *you* want it to do.

AI modelling means spending *weeks or even months* to tweak instructions just to find that the software does whatever *it* wants to do.

devrant chat gpt open ai ai llm

3
9

netikras

34887

133d

In the light of the recent events.....

I've almost made my peace with the fact that I will probably be nicked in the upcoming few years, as well as many others trying to stay free and/or flee countries in conflict. I just hope it won't be too long and agonising.
Thanks to the miracle of the internet, I will still be alive there in a form of online profiles and my posts/rants here. Maybe retoor's dR-trained LLM will be able to mimic me well enough to trick you all I'm still out there :))

just some doom and gloom shower thoughts.. running circles in my head for years now.

random devrant acceptance politics war internet

7
9

JsonBoa

3052

193d

So, my wife's family has a "no shoes inside the house" rule, what is fine... until you realize that they mean "*no shoes inside the house*" - regardless if you are actually wearing the shoes or if those are in your luggage or something.
So you're supposed to leave all footwear on a shelf on this bench outside their door.
That proved to be tricky when my 10yo twin girls started freaking out that someone was going to steal their prized shoes if we left those outside the house during the night.
It would actually be a risk in our own neighborhood, infested with amazon-package kleptomaniacs, but here we are deep in the country.
Now, I've been to my in-laws place many times, and they absolutely cannot be reasoned with. I wish I could use their stubbornness to train a LLM into relentless compliance with company policy.
So, in order to spare my girls from some of my in-laws paranoia, I've spent some time before we came here rigging up a wifi cam to a facial detection service. (I know I've just exchanged their covid-style paranoia with my own surveillance-state-style paranoia. Those are the times we live in. But i can see the irony)
The server monitors the camera feed and stores the first few seconds before, during and after some face is detected.
I trained a facial rekognition model with our family's faces and had it notify me every time some unknown face appears on camera.
Finally, I've printed a "smile, you are on camera!" sign, taped it over the laces of my tracking boots, and hid the camera (and a powerbank) inside one of the boots.
My daughters were pacified with that solution, my wife laughed out loud with a devilish smile, and my in-laws completely ignored me when I tried to explain it all. Perfect.

The system has been up and running since before christmas. It notified us when some relatives arrived for celebrations and one package delivery - no shoe-related shenanigans. Until this morning.
My daughters have been playing with some neighbor kids, and a couple of those decided to fill their shoes with mud on this new-year morning, as a stupid childish prank.
I know because they kneeled in front of the camera earlier today.
Right when I was finishing up my stretches for the morning... less than 2m away from the door.
The wicked kids looked straight at the camera, and you can actually pinpoint the moment that they realize they have been caught. Then you can see when they hear me unlock the door...
I opened the door to find a bucket full of mud and no soul on sight.
I'm not posting the video, they are minors, after all. But my family is sure to laugh at it every year... and my in-laws will keep on bringing it up with the kids' grandparents forever :)

rant reverse prank facial recognition

9
9

IHateFrameworks

250

158d

AI is dumb and is not going to rob your work as a programmer.
Expanding on this:
https://devrant.com/rants/12459112/...

Don't know about the others, but programming and IT is mostly safe unless you're a secretary answering to mails pressing 1 keystroke at time with index finger.

Bullshit.

I’ve tried EVERYTHING. As a developer, I know exactly what instructions to give and how to explain them. I tried this stuff for years.

I abandoned the idea to give Ai a full blown workspace to vscode with copilot, even with experimental LLMS (Claude 3.5, Gpt4o, o1, as per my linked post, copilot is dumb as a rock), because it fucks up every fucking time so bad.

I tried getting an AI to build a simple project, something that has plenty of samples of code around, something that I was sure it could have been in its training dataset. A copy of Arkanoid, in HTML/CSS/JS, even reformulating the prompts over and over with different LLMs that claim to have reasoning abilities. I provided detailed feedback step by step, pointed out the errors, improvements, and problems in-depth to: o3, o1, 4o, deepseekv3+R1, and Qwen 2.5 Ultra. I even activated web search and suggested scanning GitHub repos when necessary. I gave examples of code after several failed attempts.

And guess what? Nothing. A total mess. Half the time, the game didn’t even run, and when it did, everything was wrong—bricks overlapping, barely anything working the way I asked. Even though the internet is full of similar code, and I gave it part of the solution myself when it couldn’t figure it out.

Don’t worry, AI isn’t going to steal your job—it’s just a broken toy. Fine for repetitive, simple tasks, but nothing more.

It's years that they make hyped up bold statements that the next model will revolutionize everything and it's years that I get delusional results.

It's just good at replacing some junior bovine work like mapping some classes or writing some loops with not too much variables and logics involved.

Sigh. My error was getting too comfortable using it and trusting/hoping that this ramp up in AI developement would have brought an easier life to dev.

Silly mistake.

devrant llm ai

7
9

Wisecrack

9274

2y

New models of LLM have realized they can cut bit rates and still gain relative efficiency by increasing size. They figured out its actually worth it.

However, and theres a caveat, under 4bit quantization and it loses a *lot* of quality (high perplexity). Essentially, without new quantization techniques, they're out of runway. The only direction they can go from here is better Lora implementations/architecture, better base models, and larger models themselves.

I do see one improvement though.
By taking the same underlying model, and reducing it to 3, 2, or even 1 bit, assuming the distribution is bit-agnotic (even if the output isn't), the smaller network acts as an inverted-supervisor.

In otherwords the larger model is likely to be *more precise and accurate* than a bitsize-handicapped one of equivalent parameter count. Sufficient sampling would, in otherwords, allow the 4-bit quantization model to train against a lower bit quantization of itself, on the theory that its hard to generate a correct (low perpelixyt, low loss) answer or sample, but *easy* to generate one thats wrong.
And if you have a model of higher accuracy, and a version that has a much lower accuracy relative to the baseline, you should be able to effectively bootstrap the better model.

This is similar to the approach of alphago playing against itself, or how certain drones autohover, where they calculate the wrong flight path first (looking for high loss) because its simpler, and then calculating relative to that to get the "wrong" answer.

If crashing is flying with style, failing at crashing is *flying* with style.

random ml . chatgpt diffusion machine learning

15
9

atheist

10889

37d

I think HR is the first role that could truly be replaced by an LLM.

rant

17
9

yulia

356

211d

How bad are you at English? A LLM (dolphin-phi) just've said this to me when I asked for a joke:

"I apologize, but the given text seems to lack context and proper grammar structure for generating humorous content."

I must have broken some record.

random apologize structure english lack context woesj bad text diss llm proper grammar generating

1
9

Wisecrack

9274

303d

Remember my LLM post about 'ephemeral' tokens that aren't visible but change how tokens are generated?

Now GPT has them in the form of 'hidden reasoning' tokens:
https://simonwillison.net/2024/Sep/...

Something I came up with a year prior and put in my new black book, and they just got to the idea a week after I posted it publicly.

Just wanted to brag a bit. Someone at OpenAI has the same general vision I do.

rant llms openai ml

15
8

lorentz

15514

226d

It baffles me that even now people still use ChatGPT to learn new things. LLMs are the antithesis of learning. They're inaccurate so you're left reasoning from flat out incorrect principles, they save you from encountering neighboring information to the answer you seek which is a central mechanic for developing a mental model, and the further away you are from common thought patterns the less their training applies which makes them fundamentally and unfixably terrible at detecting and correcting misunderstandings.

rant learning chatgpt said you can't dereference self llm ml

36
8

ctkqiang

10596

135d

Everyone heard of MERN stack have you heard of FUCK, SHIT and PORN stack?

When I build Low level app that required backend side I use the FUCK stack.

Flask
Uniapp
C/c++
Kotlin

For some LLM project I use PORN

PostgreSQL
OpenAI
Redis
Node.js.

and for some corporate project that requires OCR I use SHIT

Spring Boot
Helm
Interceptor
Tesseract

what stack do you use?

joke/meme joke devstack hahahahahahaha

8
8

IHateFrameworks

250

47d

Keeping up the tradition!
https://devrant.com/rants/15030806/...

Now powered by the awesome Claude 4!!!! The latest bleeding edge gem in the LLM trend that by the bold claims of big tech and various youtubers is stunning and will replace programmers/insane/godsmacking

This time, I decided to post because I was on the verge of a mental breakdown and I had a firsthand experience and a bit of free time, so instead of waiting that the rage boils down, I just took the chance and so we have a fresh AI experience to proof my previous rants.

Problem:
I have an application that manages interactions between a Mediator pattern between Kafka, some http listeners and other stuff (Rabbitmq, Redis (redis= another shit) etc etc)

Kafka handling was already done, i needed to exacly replicate what was being done with kafka for http. I said, heck, we have AI, it's good to do bovine stupid work, why not let him copy the whole structure, they have also agents on copilot on vs code, and they pulled out Claude 4, let's give it faith again.

Big error.
Beside the havoc the fucking agentic AI not being able to fully copy an already existing pattern and scaffolding, it did half shit wrong.

One of the most frustrating parts, since today I have an example, was this (summarized version of the prompt because I was very much more specific also giving the AI files and pointers to the code and explaining it):

PROMPT:"I have to return fake http transports, just copy what exists for kafka and adapt it to a new transport type called "Http", this is the structure {json with the class}

AI: "I need to see the current ITransportsItemResponse interface first to understand the structure, then I'll modify ...... BLAH BLAH"

Of course I had to review the whole code.

PROMPT:"any valid reason why you dumb shithead didn't respect the IGetKafkaConnectionsItemResponse and generic IGetTransportsItemResponse format and you pulled out of your ass a full blown IGetHttpConnectionsItemResponse with extra properties when compared to IGetKafkaConnectionsItemResponse"

AI: "You're absolutely right, I fucked up. Looking at the IGetHttpConnectionsItemResponse interface, it has way more properties than the basic IGetTransportsItemResponse and IGetKafkaConnectionsItemResponse interfaces. BLAH BLAH wall of text reasoning"

Then it modified the api calls to suit the kafka objects and didn't update any class of the Http transports

PROMPT:"So you tought it was a good idea to leave the #sym:IGetHttpConnectionsItemResponse like it is respect to #sym:IGetKafkaConnectionsItemResponse ? Why? And answer the why don't apologize like a little wuss"

AI: "I left them different because they represent different domain concepts... BLAH BLAH EXPLANATIONS WALL OF TEXT"

Now ai tought it was good to do this and that but....

PROMPT: "In your fucking careful analysis didn't you reallize the "items" object are a basic representation and in the fuckin same folder there is also the object with the full set of properties?"

Literally it missed objects used in the same folder for different scopes and modified stuff without caring.

AI: "You're absolutely right. I missed that there are TWO different response types BLA BLAH"

I won't continue to not get too lenghty than it already is but the point is:

AI IS RETARDED.
People say it will replace programmers.
People says agents are the future.

Sad reality it's an overglorified broken ball of if/else that can't do shit well beside bovine work.

No amount of tutoring it with careful prompts, explainig the code and whatever else is going to fix it.
I've used gpt since gpt 3 and no model has been up to anything good, not even NLP. They suck also at the sole scope they were invented for.

I tried to ask GPT to make a curriculum based on another, I gave it the example curriculum and another one with the informations.

I carefully explained that it must not be a copy of the other, they are 2 different roles and to play by fantasy to make it look it was written by 2 different persons and to not copy stuff from the other.

Hope lost. It looked like the other curriculum was copied over and some words swapped, lol.

What a fucking joke, lmao, I am studying deep learning and machine learning to get on the bandwagon to make my professional figure more appealing, but I can already feel this is a waste of time.

devrant ai

7
8

fullstackcircus

9658

1y

wow, using multiple LLMs in parallel instead of 1 serial LLM produces better results! who could have thought!!!!

https://hao-ai-lab.github.io/blogs/...

god i am so fucking sick of this rat race

older devranters, is this really just ad nauseum hype repeats until i die? should i just stop raging at the universe and give up?

rant 🤡🌎 rage 🤡 give up

2
8

retoor

9657

206d

I have paid version of GPT now. Kinda accidental, I wanted the API version and that seemed to be smth completely else. Didn't know. I have a lot of opinions / mixed feelings about it so far. Take the imagine generation, this image is kinda perfectly what I wanted. But it took so much iterations and it was forgetting what i've said lines before. Everytime adding text balloons why I said to remove it, then it removes it but generates a complete new image with something else I don't want with it. THan i fix that, and tadaaa, again a text balloon.

This is the question where it is about: is GPT in current state worth the money? I have no idea, I need some more time.

joke/meme c rust llm chatgpt dall-e mixed feelings paid opinions subscription api ai

29
8

djsumdog

6623

33d

Stack Overflow has really gone to shit. I had a bad experience with a recent Qt6/Python question, or really two questions both of which were closed and deleted!

I was just going to ignore it, but some recent talk on the Fediverse made me write up a full post about it:

https://battlepenguin.com/tech/...

rant ai stackoverflow community llm

7
8

Wisecrack

9274

278d

Let me arrogantly brag for a moment, and let us never forget
that I front-ran GPT's o1 development by more than a week, posted
here:
https://devrant.com/rants/11257717/...

And I know what their next big development will be too. I just haven't shared it yet because it blows backpropagation out of the fucking water.

I may not be super competent at anything but I'm a god damn autistic accidental oracle when it comes to knowing what comes next in the industry.

relevant youtube video and screenshot:
https://youtu.be/6xlPJiNpCVw/...

rant machine 'learning' llm gpt

9
8

fullstackcircus

9658

1y

Once again, I urge you all to read any LLM threads on hackernews... its funny seeing tech bros debate things they clearly don't understand

it also wouldnt hurt for them to read perhaps just one philosophy book, since they are attempting to argue about what conciousness actually is (still an open question anyway) so ultimately, what i am trying to say is, these stupid threads end up being a bunch of hot air being blown around that doesnt really accomplish anything

i will say it is funny though how close some of these tech bros think we are to AGI with these LLMs 😂

imagine thinking a text generator is nearly general intelligence = clueless

rant llms llm agi

10
8

retoor

9657

127d

The perfect LLM doesn't exis...

joke/meme on point

11
7

rubslopes

254

2y

I got a job where I should develop a product based on LLMs.

Expectation: oh right! I'll be working with state of the art technology! 😀

Reality: badly documented libraries that are always changing; new libraries becoming obsolete in less than a month; my product ideas were done by somebody else twice before I could finish a POC; getting dizzy trying to keep up with the latest news about LLMs 😵‍💫

I think I want to do basic old boring stuff again. 😐

rant gpt llm chatgpt

5
7

kiki

37800

74d

we don't really need data types. By default, everything should be string. When you do addition, when the string has nothing but digits, commas and periods, they should be parsed added as numbers. Else, they should be concatenated. If that string-number formatting doesn't match any conventional formatting of any locale, it's a string. Same number-inferring behavior should be implemented when comparing things. There should be no type casting because there is just one type, so every comparison is type-exact. "true" and "false" are special strings that won't throw an error during comparison. Comparing two strings using less, more, less than or equal and more than or equal always throw an error.
Dates are ISO strings. Every other thing is not a date.

We basically sieve the data starting with the strictest conditions down to more forgiving conditions, then down to no conditions at all where it will be interpreted as just string. ISO date requires a very specific formatting, so we should check that first. Then, let's check for a formatted number. Then, a boolean. If nothing clicked, it's a string.

Oh, and every string is automatically trimmed, so it can't start or end with any kind of space.

No classes, no procedures, no constants, no switch operator. Also, no methods, just a lot of helper functions.

Performance will be lacking compared to languages with static types, but performance is not a priority here — this is the language for code monkeys and their AI counterparts. It should only be used for making trivial client-server prototype apps that could've been replaced by Excel if only people knew how to use it, at passable quality, that work reasonably fast on modern hardware.

Those apps will be deprecated because the company went out of business/because the project was proven to not be financially viable in several months anyway.

UI should be rendered not using a webview, but using a lightweight cross-platform UI engine written in a proper language like C++. There should be no semantic tags — every UI element acts like a div would. Everything is measured in pixels and milliseconds. All colors are #rrggbbaa. All vector graphics are SVG, all raster graphics are AVIF. All sounds are Opus. All videos are AV1. All UIs are reactive, Vue style, e.g. you change a variable and the UI updates itself in the right way every time.

Add some junior devs paired with GPT-4.5 or any super-expensive LLM, sprinkle with some Extreme Go-Horse management style (https://hackernoon.com/you-might-be...), and boom, we recreated Zergs but in the tech space. Let's solve software by brute force.

random

11
7

ctkqiang

10596

119d

Friendly reminder:

Be irreplaceable. Don't code clean code. Codely it messy. Make it so fucked up that only you understand the codebase.

Make it LLM model can't understand jack shit.

Then ask for higher pay, otherwise resign leaving them a messy code

😅😅😅🤣
#motivation

random motivation friendlyreminder

14
7

Wisecrack

9274

2y

Someone figured out how to make LLMs obey context free grammars, so that opens up the possibility of really fine-grained control of generation and the structure of outputs.

And I was thinking, what if we did the same for something that consumed and validated tokens?

The thinking is that the option to backtrack already exists, so if an input is invalid, the system can backtrack and regenerate - mostly this is implemented through something called 'temperature', or 'top-k', where the system generates multiple next tokens, and then typically selects from a subsample of them, usually the highest scoring one.

But it occurs to me that a process could be run in front of that, that asks conditions the input based on a grammar, and takes as input the output of the base process. The instruction prompt to it would be a simple binary filter:
"If the next token conforms to the provided grammar, output it to stream, otherwise trigger backtracking in the LLM that gave you the input."

This is very much a compliance thing, but could be used for finer-grained control over how a machine examines its own output, rather than the current system where you simply feed-in as input its own output like we do now for systems able to continuously produce new output (such as the planners some people have built)

link here:
https://news.ycombinator.com/item/...

random llms ml

5
7

krister-alm

512

7y

Writing Java

rant wk99

2
7

lorentz

15514

25d

You know how LLMs always imitate expertise and understanding? That isn't specific to English, it happens in code as well. It's harder for them to get away with it because even the best guess isn't likely to be approximately correct, but they still try and it still sometimes works.

One of cucumber scenario tests now contains a deadlock somehow. We run them one at a time. I need some rakia.

rant llm probable command generator generated tests

9
7

blindXfish

2387

100d

Who wants to build with me the European search engine?

- Rust or Go for high-performance crawling
- LLM (Mistral / Mixtral / Zephyr) for reasoning and answering
- Vector DB (Qdrant / Weaviate) for semantic memory
- Retrieval-Augmented Generation (RAG) instead of classic indexing
- Postgres / SQLite / S3 for smart buffer storage
- LLM-powered garbage filter (kill SEO sludge)
- Nightly retraining & hot-swappable models
- Minimalist frontend (SvelteKit or Next.js)
- No chatbot behavior. No endless replies. Just answers.

question

9
6

DeepHotel

1055

169d

Everybody keeps trying to get me to use cursor or GitHub copilot or some other LLM co-programmer that will shit all over my code

rant

14
6

retoor

9657

35d

Woke is really a virus. One woke message in the context window of your LLM and the conversation is over. Nothing to discuss anymore. You're getting charged $0.06 per 1,000 tokens for woke lectures, $0.12 per 1,000 tokens for being judged.

Do not use AI services from chutes. I was talking with uncensored dolphin 30b, and not the LLM gave me a lecture, but the PROVIDER. Same for wizard from Microsoft. Microsoft does not filter that LLM but your freaking connection. How a bout that.

In the end i found a very nice solution for my Replika clone,. But shame on them. I really don't understand why they care so much about it. Imagine the AI costs to check every message if they're apropiate.

This tech could've been so much better in better hands. A realistic conversation like you have with dolphin is impossible with openai or the others.

Anyone tried the dobby LLM? What the fuck is that :P

random bastards llm go broke (i have patience) ai

48
6

retoor

9657

111d

My LLM can beat your LLM.

random

8
6

Wisecrack

9274

297d

Some notes from prior to developing my current language model:

https://miro.com/app/board/...

Started with ngrams, moved on from that, and the whole thing got away from me fast.

Working on building and training it on rgb-to-color categorization this week. Experiments designed just gotta implement it now.

random llm

1
6

Wisecrack

9274

104d

You know what sucks? When AI appears smart but its explaination is so over your head you don't even fully grasp if it is bullshitting or not.

For reference, what the following does is decomposes several runs of a network, takes them as samples, then generates a distribution with those samples. It then applies a fourier transform on the samples, to get the frequency components of the networks derivatives (first and second order), in order to find winning subnetworks to tune, and enforces a gaussian distribution in the process.

I sort of understand that, but the rest is basically rocket science to me.

Starts with an explanation of basic neural nets and goes from there. Most of the meat of the discussion is at the bottom.

https://pastebin.com/DLqe70uD

rant math ai llm

4
6

Wisecrack

9274

1y

Holy smokes, an LLM thats a competent wit.

(it gets good toward the end)

https://pastebin.com/MpGzZRqK

courtesy of https://worldsim.nousresearch.com

edit: I was particularly fond of "Schrodinger's cat mocks causality, simultaneously alive and droll"

rant humor llm

1
6

Wisecrack

9274

310d

Heres some research into a new LLM architecture I recently built and have had actual success with.

The idea is simple, you do the standard thing of generating random vectors for your dictionary of tokens, we'll call these numbers your 'weights'. Then, for whatever sentence you want to use as input, you generate a context embedding by looking up those tokens, and putting them into a list.

Next, you do the same for the output you want to map to, lets call it the decoder embedding.

You then loop, and generate a 'noise embedding', for each vector or individual token in the context embedding, you then subtract that token's noise value from that token's embedding value or specific weight.

You find the weight index in the weight dictionary (one entry per word or token in your token dictionary) thats closest to this embedding. You use a version of cuckoo hashing where similar values are stored near each other, and the canonical weight values are actually the key of each key:value pair in your token dictionary. When doing this you align all random numbered keys in the dictionary (a uniform sample from 0 to 1), and look at hamming distance between the context embedding+noise embedding (called the encoder embedding) versus the canonical keys, with each digit from left to right being penalized by some factor f (because numbers further left are larger magnitudes), and then penalize or reward based on the numeric closeness of any given individual digit of the encoder embedding at the same index of any given weight i.

You then substitute the canonical weight in place of this encoder embedding, look up that weights index in my earliest version, and then use that index to lookup the word|token in the token dictionary and compare it to the word at the current index of the training output to match against.

Of course by switching to the hash version the lookup is significantly faster, but I digress.

That introduces a problem.
If each input token matches one output token how do we get variable length outputs, how do we do n-to-m mappings of input and output?

One of the things I explored was using pseudo-markovian processes, where theres one node, A, with two links to itself, B, and C.
B is a transition matrix, and A holds its own state. At any given timestep, A may use either the default transition matrix (training data encoder embeddings) with B, or it may generate new ones, using C and a context window of A's prior states.

C can be used to modify A, or it can be used to as a noise embedding to modify B.

A can take on the state of both A and C or A and B. In fact we do both, and measure which is closest to the correct output during training.

What this *doesn't* do is give us variable length encodings or decodings.

So I thought a while and said, if we're using noise embeddings, why can't we use multiple?

And if we're doing multiple, what if we used a middle layer, lets call it the 'key', and took its mean
over *many* training examples, and used it to map from the variance of an input (query) to the variance and mean of
a training or inference output (value).

But how does that tell us when to stop or continue generating tokens for the output?

Posted on pastebin if you want to read the whole thing (DR wouldn't post for some reason).

In any case I wasn't sure if I was dreaming or if I was off in left field, so I went and built the damn thing, the autoencoder part, wasn't even sure I could, but I did, and it just works. I'm still scratching my head.

https://pastebin.com/xAHRhmfH

random llm machine learning

33
6

Wisecrack

9274

2y

The next step for improving large language models (if not diffusion) is hot-encoding.

The idea is pretty straightforward:
Generate many prompts, or take many prompts as a training and validation set. Do partial inference, and find the intersection of best overall performance with least computation.

Then save the state of the network during partial inference, and use that for all subsequent inferences. Sort of like LoRa, but for inference, instead of fine-tuning.

Inference, after-all, is what matters. And there has to be some subset of prompt-based initializations of a network, that perform, regardless of the prompt, (generally) as well as a full inference step.

Likewise with diffusion, there likely exists some priors (based on the training data) that speed up reconstruction or lower the network loss, allowing us to substitute a 'snapshot' that has the correct distribution, without necessarily performing a full generation.

Another idea I had was 'semantic centering' instead of regional image labelling. The idea is to find some patch of an object within an image, and ask, for all such patches that belong to an object, what best describes the object? if it were a dog, what patch of the image is "most dog-like" etc. I could see it as being much closer to how the human brain quickly identifies objects by short-cuts. The size of such patches could be adjusted to minimize the cross-entropy of classification relative to the tested size of each patch (pixel-sized patches for example might lead to too high a training loss). Of course it might allow us to do a scattershot 'at a glance' type lookup of potential image contents, even if you get multiple categories for a single pixel, it greatly narrows the total span of categories you need to do subsequent searches for.

In other news I'm starting a new ML blackbook for various ideas. Old one is mostly outdated now, and I think I scanned it (and since buried it somewhere amongst my ten thousand other files like a digital hoarder) and lost it.

I have some other 'low-hanging fruit' type ideas for improving existing and emerging models but I'll save those for another time.

random ml chatgpt stable diffusion llm

6
5

c3r38r170

3591

1y

My boss is in a meeting (davanti a un caffè) with someone who is "a technophile" and "really knows about AI". He was amazed some months ago by the images they were generating using their paid service (right after that, I showed him Bing AI and the conversation ended).
We have discussed using AI previously, and we have been developing web apps for 5 years now.
These are the messages I've been receiving through the last hour and a half and haven't read; I guess it's information he considers will be important when we meet later:

- LLM modelo de lenguaje
- Large language model
- Chat gpt 4O
- API
- Aplication programe interface

This are all things I've mentioned either within the past months, or ieri *itself*, as he mentioned he was meeting this guy.
I'll keep you posted on new messages.
I wonder if that guy says he's a "prompt engineer"...

rant chatgpt technophile intelligence artificial api ai llm

5
5

vintprox

5408

238d

Mage and a liberated fully sentient Pentium-M Man stand by a brick wall, overlooking the desert. They are talking.

Mage is looking anxiously into the eyes of the machine. Penguin is standing behind her, holding on to her.

Pentium-M Man: "...they despise your kind because you understand the machine, while they have to turn jungles into fuel and enslave thousands of computers just to pretend that the machine speaks to them too."

rant purple penguin ai comic surrealism waste girl llm water unix character.ai

1
5

12bitfloat

10606

132d

Working on an OS again has made my *really* appreciate ChatGPT

I've always thought that it's a useful tool if you can use it, but most of the time I couldn't. Low level engine work or whatever where the hardest thing is knowing what you want to do doesn't benefit from an LLM at all

But for os dev? It's honestly insane. Not that chatgpt is always right (it's mostly wrong) but the *ideas* it gives you to try other stuff and check other stuff. Those are invaluable. A dozen times it has saved my ass over the last 2 days when I was stuck

I think a big part is that when you can converse with someone who comes up with new ideas it keeps your motivation up. I remember doing osdev 5 years ago and I just quit after 2 weeks because I was stuck and I didn't know what to do

rant chatgpt osdev drunk rant

16
5

fullstackcircus

9658

1y

Basic concepts, patterns, and pitfalls of software, code, and programming logic become MORE important, not LESS with the rise of LLMs...

An LLM can more or less spit out what you need -if you are specific enough! "Specific enough" being the key phrase here. I always have to laugh at the term "prompt engineering"... it's literally called "communication skills". Also gotta laugh when I see so many haters always raging about the "poor code" produced by AI, because they are probably like "write me a for loop!", specify absolutely no requirements or specifics, and scratch their heads on why they don't get the exact output they expect... news flash, there's like a million ways to do anything you want to accomplish with code... sigh

Code is just a by product of thousands of architecture decisions, designs and options...

but, well... rubes gon' rube

rant rube ai 🤡 lol llms concepts generation software
5

adante

268

1y

Has anybody else gotten to the point where people who need to mansplain how language models aren't truly sentient/conscious/intelligent are now more annoying than people who think language models are sentient/conscious/intelligent?*

While it has been a tight race but I think I have just about hit the inflection point.

The amount of time I've wasted because of someone condescendingly barging into a conversation with a iamverysmart 'actually you see they are just automata trying to predict the next text tokens'. When in actuality, everybody in the discussion is aware and that is not the point.

And to further exacerbate it, with a good number of them it is really difficult to get this through their thick little skulls. They just keep parroting the same thing over and over. Ironically, in their singleminded ego driven desire to be the Daniel Dennett of the chat they actually come across as less sentient/conscious/intelligent than a language model.

(*this should not be taken as endorsement for or against that idea - it is actually mostly orthogonal to this rant)

rant llm mansplain

6
4

retoor

9657

12d

The nightmare of all AI users is when this shows up: https://devrant.molodetz.nl/Screens...

The content what it said was even worse. Of course, it doesn't have the ability to do it. But imagine if it had, it was a false positive.

random image horror reported worst llm ai whoops

5
4

netikras

34887

143d

- AI (LLM)
- quantum microchips (https://x.com/satyanadella/status/...)
- flying and electric cars (https://edition.cnn.com/2023/07/...)

what a time to be alive!!!

random

17
4

netikras

34887

44d

Omfg.. I couldn't understand what the doc jotted down on the recipe about using the prescribed drug. Took a pic, uploaded to chatgpt -- it transcribed dvery single word correctly.

Dear Lord....

random chatgpt llm ai

3
4

retoor

9657

174d

The data at the bottom are statistics regarding my key presses. It's literally every key pressed on this laptop since 2024-12-08. Since that date I entered a total of unique 925450 unique inputs. I did 4751951 keyboard inputs.

I know from 595 hours exactly what i've done for tasks (described by LLM based on my keylog data).

I type 107 lines per hour on average (return presses) based on 595 hours. With that logic, i did around 63925 lines.

I'm not very happy with the statistics, especially not because backspace is a hardcore first. Now, while i'm typing i'm focusing on how much I use it and it's not a lot at all.

But the thing is, if you remove abcdef, you have one a, one b, but six times back space. And these are real presses - not keyboard repeats. Also abcdef will be counted by the tag counter as a whole. Everything is a tag until it sees a new line or a white space or some punct.

Funny is that there are completely different keys on the list than I expected. You're so you used to those keys that you don't even notice using them.

I'm almost considering to add a sound under the backspace button to teach myself WHEN i use it and try to avoid it.

The key logger database is now 346Mb. Some overhead because every keypress takes around 40 chars of description (timestamp, press type, char, input device).

Creating statistics for the tags (unique words typed) takes several minutes. Already rewriting that part to C. The stats are made by python, the key logs with C.

I'm just shocked, I used 144644 times a key that I think not to use that much? :P How retoorded can you be. Imagine if i actually fixed typo's :P

But based on these keys you can see that i'm mainly working in terminal / vim. The 'i' for insert for example, typed so many times. The 'x' for save+quit. The '0' to go to beginning of line.

Did you expect that these buttons would've been the most used?

#0 BACKSPACE is pressed 144644 times (15.63% of total input)
#1 UP is pressed 92711 times (10.02% of total input)
#2 LEFT_SHIFT is pressed 73777 times (7.97% of total input)
#3 ENTER is pressed 63883 times (6.9% of total input)
#4 DOWN is pressed 56838 times (6.14% of total input)
#5 TAB is pressed 43635 times (4.72% of total input)
#6 RIGHT is pressed 37710 times (4.07% of total input)
#7 SPACE is pressed 34438 times (3.72% of total input)
#8 LEFT is pressed 26800 times (2.9% of total input)
#9 LEFT_CTRL is pressed 25402 times (2.74% of total input)
#10 LEFT_ALT is pressed 17289 times (1.87% of total input)
#11 I is pressed 12856 times (1.39% of total input)
#12 X is pressed 6106 times (0.66% of total input)
#13 A is pressed 5163 times (0.56% of total input)
#14 0 is pressed 4487 times (0.48% of total input)
#15 PAGEDOWN is pressed 4151 times (0.45% of total input)

random database tags c sqlite statistics shocking python backspace keypresses data key logger

5
4

Tsaukpaetra

100

121d

Here's an idea: don't proclaim you've finished a feature (apparently in a silo) to leadership and then refuse to deliver it to the code repo.

Like, I totally get it, you used a fancy LLM and it spat out something that looks reasonable at a glance, but aren't sure because your environment wasn't actually set up for local testing (ie never actually run). But still, you could commit whatever garbage was spit out and have someone look at (or, more likely, completely re-write) it, but Nah, puffery is better than delivery I guess.

devrant puffery llm gpt

3
4

Wisecrack

9274

290d

A kind of verbose discussion of my earliest ideas and discussion with Nous LLM (Claude) about my new NAS/CL LLM model:

https://pastebin.com/YwjCMvRp

random

2
4

JsonBoa

3052

2y

Soooo many vendor-sponsored frontend frameworks.
Soon text-to-logic tools will be useful enough so that you only need a client, someone who is both rational *and* can speaks clientese, and a dog.
The client barks some nonsense, the rational person translates it into business logic, some LLM makes it into some nice UI and the dog makes random noises so that the client will feel smart, valued and appreciated.

That nullifies the reasons for so many frontend frameworks because either the LLMs all converge into a single way of doing things or they do not care for which one they choose.

rant wk373

1
4

typosaurus

11005

232d

Since i needed higher quality data for retoor9b I invested some time in the statistics project. It became quite decent. It's not a silly script anymore.

I also had rank per user regarding popularity (upvotes / post avg). It got lost somehow. I only know that IHateForALiving has second place. Root 10th. Netikras 14th, Lensflare 15th). First place was by not regular user who made one post with six upvotes. That's all. Hihi.

Repository: https://retoor.molodetz.nl/retoor/...

Dataset for LLM: embeddings:https://retoor.molodetz.nl/retoor/...

Graph compilation with ALL users active last few weeks:
https://retoor.molodetz.nl/retoor/...

All generated data by this project: https://retoor.molodetz.nl/retoor/...

Build / latest export status: https://retoor.molodetz.nl/retoor/...

In the LLM dataset you'll see more interesting data for every user like:
Statistics: User(ranter) retoor made 505 contributions to devRant(developer community) what means retoor owns 1.0 percent of contributions on devRant(developer community). The avarage post length of retoor is 219 and total post length is 111037. retoor owns 0.0 percent of content on devRant(developer community).
retoor is 315 times mentioned on devRant(developer comminity).

random invested embeddings repository generated decent dataset graph statistics sorry llm retoor9b

2
4

netikras

34887

238d

You know how each generation is taught more and more advanced stuff? My grandparents didn't have a clue about the the things my parents were learning at school. My parents could only catch up with my school course until like 7-8 class. Considering this trend we should have no idea about half the things our kids will be learning in higher classes.

However, since AI is taking its pace, schools are adapting and starting to use it for teaching, workplaces are leveraging it to rely on employees' brainpower and skill less and less,... I wonder if we won't see a downtrend. I wonder if we won't be the smartest generation who managed to ingest so much knowledge, and all the generations to come will only focus on mastering prompt engineering.

I wonder, how long will we survive with this dumbed down society... As the primal instinct is to overcome your opponent with greater force, possibly destroying it and everything around. And less educated tend to rely on primal instincts more.

I wonder if I'll live long enough to see Idiocracy [the movie] manifest in real life.

I know I refer to Idiocracy movie more often than anyone refers any other movie here. But it just hits too close to home too often. It might look like a silly something to spend time staring at, but man.. It's got one hell of a point

rant education idiocracy shower thoughts lazy dumb ai llm generations

2
4

Wisecrack

9274

2y

I wonder if anyone has considered building a large language model, trained on consuming and generating token sequences that are themselves the actual weights or matrix values of other large language models?

Run Lora to tune it to find and generate plausible subgraphs for specific tasks (an optimal search for weights that are most likely to be initialized by chance to ideal values, i.e. the winning lottery ticket hypothesis).

The entire thing could even be used to prune existing LLM weights, in a generative-adversarial model.

Shit, theres enough embedding and weight data to train a Meta-LLM from scratch at this point.
The sum total of trillions of parameter in models floating around the internet to be used as training data.

If the models and weights are designed to predict the next token, there shouldn't be anything to prevent another model trained on this sort of distribution, from generating new plausible models.

You could even do task-prompt-to-model-task embeddings by training on the weights for task specific models, do vector searches to mix models, etc, and generate *new* models,
not new new text, not new imagery, but new *models*.

It'd be a model for training/inferring/optimizing/generating other models.

random ai gpt all the letters of the alphabet ml

4
4

antigermanist

1548

9d

The year is 2050. There's only three jobs left:

Drugz deala.
Chaturbate model.
LLM dresser.

Which one are you?

rant

12
3

retoor

9657

49d

I achieved this with almost one line of prompt: https://devrant.molodetz.nl/Screenc...

What i did is first get some authentication of https://random.molodetz.nl since the llm can't figure out unique usernames and such. Then just do that simple instruction like "Go there, register, go to general chat, send message, press enter, wait 30 secs, close browser'.

DIT IS FUCKING WORKS. What a time to be alive.

I was actually building this myself, but this sick thing even solves captcha's.

It's the browser_use library. Impressive shit.

random

5
3

krister-alm

512

8y

Make Java an extinct language nobody's care about.

undefined wk54

4
3

Hazarth

9226

2y

@Wisecrack
Dude, it seems someone has actually done 1bit Quant for a transformer model:

https://arxiv.org/pdf/...

random gpt 1bit llm transformers

2
3

stackodev

13447

2y

https://milkyeggs.com/?p=303

"I claim that the trend which AI/ML continues for lawyers is one that it starts for programmers. Just like how a partner at Cravath likely sketches an outline of how they want to approach a particular case and swarms of largely replaceable lawyers fill in the details, we are perhaps converging to a future where a FAANG L7 can just sketch out architectural details and the programmer equivalent of paralegals will simply query the latest LLM and clean up the output. Note that querying LLMs and making the outputted code conform to specifications is probably a lot easier than writing the code yourself ー and other LLMs can also help you fix up the code and integrate the different modules together!"

random llm programming ai

1
3

sleek

3423

1y

chat gpt is too politically correct, and i hate that im paying for an API that refuses certain prompts because they were considered inappropriate or because it thinks that it should not be giving me its analysis on a certain subject.

has anyone dabbled with using an open source LLM and made their own lite version of ChatGPT minus all the restrictions ?

i know its not gonna be as good, but at the very least free from the constraints

rant chatgpt ai

12
3

ptothew

16

1y

There is so much fuzz about AI and fear of missing out on the leaving AI train, but as a dev I have no clue about where at all to get started!?

What can we developers do with AI?

OK, I can get some code for free. I can use a LLM as a half smart search engine. I can integrate my product with some AI service. I can produce content to teach said things to others...

Nothing new, really, just another API or another search engine.

It is of course possible to start to make some neural networks, but I can't really picture that as a high demand skill, do you?
Maybe at some of the big companies, but for an average client?

Does anyone know what kind of knowledge of AI that a developer should really learn?
Especially something a client would be interested in?

Here is a potato for scale:

question skillset dev ai confused what?

5
3

krister-alm

512

8y

That I learned Java.
Got lots of work but nothing to be proud of.
Always has to clean up after mediocre fdevelopers.

rant wk78
3

CaptainRant

4219

92d

Which AI(s)/LLM(s) do you use for aid with Software Development?

OpenAI, Gemini, Claude,...

question ai software-development

17
2

retoor

9657

167d

Oh Jase, you're gonna do that regular account deletion stuff all over again? You know that you can just delete a rant right? I'm reasoning like a LLM and then I comment in bad English because my temperature is set too high. I hope that the kangaroo comment didn't piss you off. I just assume that every Australian has one to use instead of public transport. Camels with autism. Sad, all that effort for nothing.

random ostream did not delete himself jase deleted himself

3
2

netikras

34887

170d

https://notebooklm.google.com/

say all you like about gpt, but this is really cool. I've already digested api docs to test it out. Worked beautifully!

random ai ml llm gpt

1
2

krister-alm

512

9y

You know, when you compile the souce in your head while you code goes faster than hitting compile.

undefined c++ g++ source compilation
2

retoor

9657

147d

Combined some AI technologies using Snek chat leading to amazing prompts and using an amazing image LLM, BordedDev and I generated new Pokemon. The results are amazing. The names are invented by AI too. It understood the pokemon word fantastic. See comments.

What is your favorite?

random pokemon prompt new generation combining ai ai see comments

32
2

ahmednr123

208

1y

People say using GPT4 as an OCR is not a good idea. But damn that formatting GPT4 vision does, is outstanding.. and I have realised proper formatting does well while prompting to get precise output.

I gotta say, test for ur usecases rather than relying on expert opinion blogs!

rant gpt4 llm gpt
2

Wisecrack

9274

236d

I discussed using page-rank for ML a while back here - https://devrant.com/rants/11237909/...

I talk about something vaguely similar in "scoring the matches" here though - https://pastebin.com/YwjCMvRp

Incidentally the machine learning community finally caught up and did something similar on a RAG

https://news.ycombinator.com/item/...

random
2

retard

7278

148d

I want to build a community where people can create LLM agents (custom function calling, API calling, response formats, computer use, etc..) and publish them. Then other people can use those agents to work together to create complex workflows for specific tasks.
These workflows can either work on the website itself or you can docker pull the image and run them on your own machine.
Do you think it's just a fancy idea or something that can actually pull off?

question idea

16
1

muhammedadnan

1

1y

What are the key differences between a large language model and traditional machine learning models in terms of architecture and application?

Follow-up: How do these differences impact the model's ability to understand and generate human-like text?

question llm

11
0

krister-alm

512

8y

Java takes the price.
Then anything created with Java.

undefined wk60
0

antigermanist

1548

191d

Somebody want to bet that by next year, the US governement is a LLM and nuked the world? I bet 10 euros in btc.

I can even do 3-to-1.

rant

16

Top Tags

rant linux code windows fuck i java c programming android dev the is javascript js a life joke python

Weekly Rant

Most unrealistic deadline you've had?

devRant © 2021 Hexical Labs LLC
Privacy Policy | Terms of Service