Dr. Gerrit Großmann
Cennet Oguz, M.Sc.
Dr. Simon Ostermann
Prof. Dr. Verena Wolf
For any issues regarding the seminar, please e-mail Gerrit Großmann and have [LanguageModelsSeminar2024]
(including the brackets) in the subject line.
waiting list
by emailing us.September 19 (Thursday) and 20 (Friday)
, in person (DFKI, D3.2, room Reuse, next to the main entrance).To pass the seminar, you have to attend all sessions and:
… with a passing grade.
For CS students, the final grade is calculated from the weighted average. Both CoLi and CS students have the opportunity to earn a bonus for submitting an optional practical project. A good project can improve your final grade by 0.3 points (e.g., from 1.7 to 1.3), an excellent project can boost your grade by 0.7 points (e.g., from 2.3 to 1.7).
This is a block seminar in September 2024, which will be held in cooperation between the Department of Multilingual Technologies (MLT) and Neuro-Mechanistic Modeling (NMM).
Large language models (LLMs) have swiftly become a cornerstone in AI research, capturing the attention of the public as the most accessible gateway to artificial intelligence. Despite their groundbreaking impact, LLMs are not without their imperfections. Notably, the occurrence of hallucinations and limited reasoning capabilities, particularly in specialized domains, remain significant challenges.
This seminar begins by investigating the theoretical foundations of language representations, tracing the evolution of transformers and their progression towards the cutting-edge LLMs we see today. Building on this foundation, the seminar will then explore promising future directions. Special emphasis will be placed on the integration of LLMs with neuro-symbolic reasoning and the enrichment of these models through knowledge graphs and other forms of structured data.
We expect no prior knowledge in language modeling. The seminar is open to CS (including related majors) and CoLi students.
TBA
TBA
TBA
| Topic | Student | Paper | | ——| —— | —— |
TBA