RLLMv3 vs. RLLMv10 vs. RLLMv11 vs. RLLMv12

Mar 15, 2024

This post, although not fully fleshed out, is worth documenting for the subtle differences in behavior it reveals among the four GPT2XL variants I trained with ethical alignment under RLLM. For more information, please refer to the visual map I created.

Read →

Comments

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts

whitehatStoic

RLLMv3 vs. RLLMv10 vs. RLLMv11 vs. RLLMv12