Artyom Karpov

  • Home
  • Blog
  • Archives
  • Table of Contents
  • Overview

Artyom Karpov

SE, ML, AI Safety, philosophy
16 posts
10 tags
E-Mail GitHub Linkedin StackOverflow Twitter RSS

Inducing human-like biases in moral reasoning LMs

Posted on 2024-02-27

This presents an inconclusive attempt to create a proof-of-concept that fMRI data from human brains can help improve moral reasoning in large language models.

https://www.lesswrong.com/posts/eruHcdS9DmQsgLqd4/inducing-human-like-biases-in-moral-reasoning-lms

# LLM, neuroconnectionist
How important is AI hacking as LLMs advance
CCS on compound sentences
© 2025 Artyom Karpov
Powered by Hexo & NexT.Mist
0%