A machine-readable corpus of Gavin Leech’s research. One Markdown file per paper.
Each file opens with a YAML frontmatter block (structured metadata a parser can
read without any NLP), followed by an ## Abstract and — where available —
## Full text.
| field | meaning |
|---|---|
title |
display title |
full_title |
formal title, when it differs from the display title |
authors |
ordered list; Gavin Leech appears in position |
gleech_role |
e.g. co-first author, editor, sole author (omitted if plain co-author) |
year |
publication / release year |
venue |
journal, conference, or publisher |
type |
book | thesis | journal | conference | workshop | preprint | report |
status |
review/publication status, when not formally published |
doi |
DOI, when assigned |
arxiv |
arXiv id, when present |
url |
canonical landing page |
pdf |
direct PDF, when available |
code |
code repository |
links |
named secondary links (blog, explainer, data, video, poster…) |
contribution_hours |
Gavin’s self-reported hours on the work |
topics |
free-text topic tags |
full_text |
true when the body includes the paper’s full extracted text |