Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Posts

Aligning AI with Humanity: The Role of Reinforcement Learning in Language Model Alignment

34 minute read

Published: March 01, 2024

In this work, we look into the prominent applications of Reinforcement Learning (RL) in the field of Natural Language Processing (NLP) with a focus on Language Models (LM). First, we examine one of the initial applications of Reinforcement Learning with Human Feedback (RLHF) in NLP. Then, we discuss how this method evolves to be applied in a more general AI and becomes a fundamental aspect of Large Language Model (LLM) training. Also, we discuss the risks, challenges, and potential problems associated with RLHF, offering insights into how these issues might be addressed and mitigated. Furthermore, we explore the emerging field of Reinforcement Learning with AI Feedback (RLAIF), assessing its position in current research. Our investigation shows that RLHF training is a very effective tool for language model alignment. This method cannot only improve the performance of the overall model in NLP benchmarks but also help with problems such as hallucination. In addition, we showed that methods like Constitutional AI can improve the LLMs’ safety by increasing harmlessness while keeping high levels of helpfulness.

portfolio

Portfolio item number 1

Short description of portfolio item number 1

Portfolio item number 2

Short description of portfolio item number 2

publications

Syntax-Guided Transformers: Elevating Compositional Generalization and Grounding in Multimodal Environments

Published in The Conference on Empirical Methods in Natural Language Processing (Genbench Workshop), 2023

In this work, we proposed the syntax guided transformer to improve the compositional generalization in grounding.

Recommended citation: Kamali, Danial, and Parisa Kordjamshidi. "Syntax-Guided Transformers: Elevating Compositional Generalization and Grounding in Multimodal Environments." GenBench: The first workshop on generalisation (benchmarking) in NLP. 2023.
testets https://github.com/hlr/syntax-guided-transformers

Using Persuasive Writing Strategies to Explain and Detect Health Misinformation

Published in 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024

In this work we introduce a persuasive strategy detection dataset and show using their labels can improve misinformation detection and explanation.

Recommended citation: Kamali, D., Romain, J., Liu, H., Peng, W., Meng, J., & Kordjamshidi, P. (2023). Using Persuasive Writing Strategies to Explain and Detect Health Misinformation. arXiv preprint arXiv:2211.05985. https://arxiv.org/abs/2211.05985

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Danial Kamali

Sitemap

Pages

Page Not Found

Danial's Website

Archive Layout with Content

Posts by Category

Posts by Collection

CV

Markdown

Page not in menu

Page Archive

Portfolio

Publications

Sitemap

Posts by Tags

Talk map

Talks and presentations

Teaching

Terms and Privacy Policy

Blog posts

Jupyter notebook markdown generator

Posts

Aligning AI with Humanity: The Role of Reinforcement Learning in Language Model Alignment

portfolio

Portfolio item number 1

Portfolio item number 2

publications

Syntax-Guided Transformers: Elevating Compositional Generalization and Grounding in Multimodal Environments

Using Persuasive Writing Strategies to Explain and Detect Health Misinformation

talks

Talk 1 on Relevant Topic in Your Field

Tutorial 1 on Relevant Topic in Your Field

Talk 2 on Relevant Topic in Your Field

Conference Proceeding talk 3 on Relevant Topic in Your Field

teaching

Teaching experience 1

Teaching experience 2