報告題目:人工智能對齊 (AI Alignment )

報 告 人:陸海兵 教授 美國加州圣塔克拉拉大學(xué)
邀請人: 趙志華
報告時間:6月20日9:30-11:30
騰訊會議ID:587-697-123
報告人簡介:陸海兵教授現(xiàn)就職于美國加州圣塔克拉拉大學(xué)。于2002年和2005年在西安交通大學(xué)獲得學(xué)士和碩士學(xué)位,并于2011年在Rutgers大學(xué)管理專業(yè)(信息科學(xué)方向)獲得博士學(xué)位,曾在新加坡管理大學(xué)任研究員(2005-2006)。陸海兵教授的研究興趣主要集中在數(shù)據(jù)挖掘、隱私和安全以及優(yōu)化等方向的交叉領(lǐng)域,已經(jīng)在Manufacturing & Service Operations Management、IEEE Transactions on Big Data、IEEE International Conference on Data Engineering、IEEE Transactions on Dependable and Secure Computing等期刊及會議上發(fā)表多篇論文。
報告摘要:
As artificial intelligence (AI) systems become increasingly integrated into various aspects of society, ensuring that these systems align with human values, goals, and ethical standards is of paramount importance. This presentation explores the key concepts and principles in AI safety and alignment, a critical field dedicated to developing AI technologies that act in ways consistent with intended ethical guidelines and societal expectations. The presentation will address the technical and ethical challenges associated with aligning AI systems, including value alignment across diverse contexts, mitigating unintended consequences, and addressing conflicts between AI goals and human objectives.
We will examine several cutting-edge methodologies and approaches, such as reinforcement learning for human feedback, scalable oversight mechanisms, and mechanistic interpretability. In addition to technical solutions, the presentation will briefly cover recent developments in AI regulations, such as the EU AI Act, highlighting their impact on AI governance and the importance of regulatory frameworks in promoting responsible AI deployment. We will provide an opportunity for participants to engage with, evaluate, and discuss these concepts.
主辦單位:數(shù)學(xué)與統(tǒng)計學(xué)院