Machine Learning Researcher | AI Safety | Natural Language Processing
I am a Ph.D. candidate at [Your University], working on [Your Research Area]. My research focuses on developing safe and aligned AI systems that can understand and reason about complex human preferences. I'm particularly interested in the intersection of machine learning, cognitive science, and philosophy.
Developing methods to ensure AI systems reliably pursue human-intended goals, even in novel situations. Focus on reward modeling and preference learning.
Building language models that can understand context, reason about implications, and communicate effectively with humans across diverse domains.
Studying how to make AI systems more robust to distribution shift, adversarial examples, and edge cases in real-world deployments.
Designing interfaces and interaction patterns that enable effective collaboration between humans and AI systems in complex tasks.
I'm always interested in discussing new research ideas, collaborations, or opportunities. Feel free to reach out!