massive prompting

Ranter

retoor

9633

Comments

3

jestdotty

6793

115d

5$?

idk
3

retoor

9633

115d

@jestdotty wow actually. Will tell you later how much wrong you were after others gave it a try.
3

retoor

9633

115d

@Demolishun or 4.20 canadian rubbles
2

BordedDev

2198

115d

80 cents? Maybe $1.20
4

Lensflare

20066

115d

π €
3

SoldierOfCode

1977

114d

Considering your whole article on molodetz about how cheap AI, I assume some comically small amount like, idk, 29 cents or something
3

SoldierOfCode

1977

114d

Okay, went back to your article, and then visited gpt's pricing for 4o-mini, now my answer is $4.5 (Not changing my original answer because that's half cheating since that's against the spirit of guessing), but that's just for input, you gave no indication for the scale of the output so no way to guess or measure it.
2

kiki

37798

114d

around $7
1

BordedDev

2198

114d

I'm surprised no-one has said 3 fidy
4

retoor

9633

114d

I'm surprised but it was 20,-. Better guesses than expected! But why am I sharing this? Many people go for self hosted solutions for Llm's and so did I at the beginning. But with such pricing, why would you ever do local?

What is the data? All my keyboard input of the last few months. Yes, I wrote 300 harry potter books in amount of keypresses.

Another about self hosting vs hosted: local llm's are woke too.

But also a realization: if you pay for got the 24,- a month.. Realize that you pay for 300 harry potter books on input and in reality won't do 25% of one book. But the api access and use openwebui, not Chatgpt itself
2

retoor

9633

114d

I'm very happy with your realistic estimations!
1

BordedDev

2198

114d

@retoor Because I'm a niny and I like being able to see it work ;P
1

Wisecrack

9367

112d

Depends on your data format.

You appear to use english as a primary language, but we don't have enough information to know the languages of the source text, and its unlikely the source text was generated by you.

So we have to make some big assumptions to answer your question.

Assuming ASCII, or UTF-8 text, primarily english, we assume one byte per character.

We also don't know the format the text data is in, so it actually may be using chars not in the ascii range of utf-8 (first 128 characters or first byte, correspond to the first 127 in ascii).

We're looking at 157,440,000 bytes IF our assumptions are correct.

Assuming o4 mini generates as many characters out as it takes in input, and because it is a reasoning model (which tend to saturate their reserved output tokens), any given run should use the entirety of the tokens available for that run (this is a bit of guesswork, but for the purposes of napkin mathing it, it will be alright).
2

Wisecrack

9367

112d

Now, with 157.44 million characters, and an average of 4 characters per token (a simplification due to BPE encoding), we're looking at 39.36 million tokens to process.

39.36 million tokens input, times $0.15 per million tokens, equals $5.904 for the cost of the entire input.

And assuming it generates at least as many tokens output (including reasoning tyokens) per input (discounting the separate cost of reasoning tokens, which I'm unsure of), factoring that it charges $0.075 per million tokens out, you're looking at $2.952 for the output of the entire set, for a grand total of $8.82 to process your entire data set.

Was I close?
2

AvatarOfKaine

3902

97d

@retoor problem I found at the time is self hosted llms are kind of crappy option wise
1

retoor

9633

96d

@AvatarOfKaine that's correct. They're still woke, the uncensored ones suck or don't support tool calls and the hosted ones are very cheap.
2

AvatarOfKaine

3902

96d

@retoor wait what models andnor archs are you referring to ?
2

retoor

9633

96d

@AvatarOfKaine cool uncensored models are dolphin. Like tinydolphin, dolphin-phi etc.
2

AvatarOfKaine

3902

96d

@retoor yes but what do they do ? Lol

Like I ran gpt 2 and it was horrible

I can't imagine an incredibly functional nlp running locally

Add Comment

rant