Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
Posts
portfolio
publications
Investigating information transfer in ECoG time series during visual perception
Published in University of Sussex, 2022
I investigated directed functional connectivity in human electrocorticographic (ECoG) data during visual perception motivated by predictions from predictive coding.
Identifying a preliminary circuit for predicting gendered pronouns in gpt-2 small
Published in Apart Hackathon on mechanistic interpretability, 2023
We explore the use of automated circuit discovery to extract a circuit predicting gender pronouns in GPT2
Linearly Structured World Representations in Maze-Solving Transformers
Published in Proceedings of UniReps: the First Workshop on Unifying Representations in Neural Models, 2023
We find linear representations in transformers trained to solve mazes
An information-theoretic study of lying in LLMs
Published in ICML 2024 Workshop on LLMs and Cognition, 2024
We investigate the dynamics of the predictive distribution across the layers of LLMs instructed to lie and tell the truth using information theory and logit lens.
Degeneracies are sticky for SGD
Published in AI alignment forum, 2024
Inspired by singular learning theory, we study the behaviour of SGD around degenerate minima in toy loss landscapes
talks
Talk 1 on Relevant Topic in Your Field
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Conference Proceeding talk 3 on Relevant Topic in Your Field
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
