Topic

Code Models

Researchers behind code-specialized models, datasets, and evaluation setups for software engineering tasks.

Start with Koray Kavukcuoglu, Matteo Grella, Xiangru Tang if you want the clearest first pass through code models as it shows up in practice.

This area overlaps heavily with OpenAI, Google, Meta. Common institution signals include Northeastern University, Wellesley College, Boston College. Recurring starting points include BigCode (project), StarCoder: may the source be with you!.

Related Labs

OpenAI Google Meta Google DeepMind

Snapshot

Researchers

139

Related labs

Starting points

Developed dossiers

Angles To Understand

Useful entry points pulled from the strongest linked researcher dossiers.

Large-scale research leadership at Google DeepMind

Via Koray Kavukcuoglu

Original RWKV authorship

Via Matteo Grella

Agentic AI for biomedical discovery

Via Xiangru Tang

Open code LLMs (StarCoder)

Via Brendan Dolan-Gavitt

StarCoder: may the source be with you!

Via Arjun Guha

BigCode (project)

Via Danish Contractor

Institution Signals

Frequent institutions showing up across profiles in this area.

Northeastern University (2)Wellesley College (2)Boston College (1)Crisis24 (1)Daresbury Laboratory (1)Google (1)Google DeepMind (1)IBM (1)

Canonical Starting Points

Papers, project pages, and repositories that recur across this part of the field.

BigCode (project)

Linked by 62 profiles in this topic

StarCoder: may the source be with you!

Linked by 62 profiles in this topic

Evaluating Large Language Models Trained on Code

Linked by 36 profiles in this topic

CodeGemma: Open Code Models Based on Gemma

Linked by 20 profiles in this topic

Gemma (docs)

Linked by 20 profiles in this topic

Code Llama: Open Foundation Models for Code

Linked by 17 profiles in this topic

RWKV (project)

Linked by 2 profiles in this topic

RWKV: Reinventing RNNs for the Transformer Era

Linked by 2 profiles in this topic

Frequently Linked Sources

Source clusters that repeatedly anchor researchers in this area.

BigCode (project)

Used across 62 researcher pages in this topic

StarCoder: may the source be with you!

Used across 62 researcher pages in this topic

Evaluating Large Language Models Trained on Code

Used across 36 researcher pages in this topic

CodeGemma: Open Code Models Based on Gemma

Used across 20 researcher pages in this topic

Gemma (docs)

Used across 20 researcher pages in this topic

Code Llama: Open Foundation Models for Code

Used across 17 researcher pages in this topic

Researchers To Start With

A stronger first pass through code models, ranked by profile depth, evidence, and editorial importance.

Koray Kavukcuoglu

Large-scale training, systems

4 sources

A high-signal figure for understanding how DeepMind turned ambitious research systems into durable products, especially across reinforcement learning, speech, and code generation.

Google DeepMind Multimodal Systems & Infrastructure

Start HereWaveNet

Matteo Grella

RWKV and efficient sequence modeling

4 sources

Worth keeping because he is one of the original RWKV coauthors who clearly did not stop there: his public work moves into production AI for crisis intelligence, security-aware infrastructure tooling, and later open-LLM experimentation.

Open Models Systems & Infrastructure

Start HereMatteo Grella at Crisis24

Xiangru Tang

RWKV and efficient sequence modeling

5 sources

Worth keeping because it connects an early RWKV byline to a much more visible later research program in agentic AI, biomedical discovery, and code-focused evaluation, which makes the page far more useful than a one-paper ghost profile.

Open Models Evaluation & Benchmarks

Start HereXiangru Tang

Brendan Dolan-Gavitt

Open code LLMs (StarCoder)

2 sources

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models

Start HereStarCoder: may the source be with you!

Arjun Guha

Open code LLMs (StarCoder)

2 sources

Co-authored StarCoder: a foundational open code model effort (BigCode).