gpt

Ranter

Wisecrack

9352

Comments

3

Wisecrack

9352

3y

Incidentally I'm surprised someone hasn't added 'tuneable' personality settings, like were demonstrated by TARS in Interstellar.

If not simple, it should at least be straightforward.

1. Do paramater-tuning based training on the model, so it can summarize various aspects of an input or generated output, along various numeric ratings, humor, candor, persuasion, wittiness, etc.

2. when an output is generated in response to an input, run sentiment analysis.

3. feed the result back in (not like auto-gpt) and rewrite to move the generated output closer to the desired global personality settings of the script.
3

lorentz

15345

3y

@Wisecrack Modifying LLMs on the fly is uncommon. Personality traits are usually configured with very expensive and slow manual feedback. This is suspected to be part of the reason why Bing turned out a sociopath; Microsoft decided that they didn't have time for alignment.
1

Wisecrack

9352

3y

@lorentz I'm thinking more along the lines of customizing the existing model with some additional training for sentiment analysis.

You might have it generate a reply internal that then gets fed in, rated a "9/10 in formality" and then fed back in to the LLM with part of the prompt being "rewrite the following input to be 6/10 formal, with 3/10 rating for humor."

Think of it as a top-k type setting, only done entirely within prompt instead of within training.

I might actually try it considering I have access to gpt4.
1

lorentz

15345

3y

@Wisecrack I see. I worry if you pipe answers through processing steps with something as coarse as a LLM the valuable information content will erode quickly.
1

Wisecrack

9352

3y

@lorentz it might be interesting to do a step where you break the potential output into a series of bullet points, just the facts. And only then summarize, before applying sentiment analysis and rewriting.

A final step might include feeding the out back in as input and trying to get matching bullet points out. If those match the initial bullet points semantically, then you would output the original rewritten summary, complete with sentiment transform.

It's easily visualizable as a behavior tree api interacting as a sort of middleware with the core llm.

Related Rants

Add Comment

question

ml

ai