6 hours ago
Godot GDscript Code Dataset 5k
This dataset contains GDScript code from 5k+ github repositories. Data from each repo has been extracted into a text file. Each text file contains the code from all .gd files & README.md text (if the README was not empty in the original repo).
Dataset collection date: June 2025
Dataset structure:
Total data: 5,172 (660 MB)
Data format:
Each txt file has the following format:
Download: https://huggingface.co/datasets/wallston...pt-dataset
This dataset contains GDScript code from 5k+ github repositories. Data from each repo has been extracted into a text file. Each text file contains the code from all .gd files & README.md text (if the README was not empty in the original repo).
Dataset collection date: June 2025
Dataset structure:
Code:
files/
├── repo-name-1.txt
├── repo-name-2.txt
├── repo-name-3.txt
├── ....
Total data: 5,172 (660 MB)
Data format:
Each txt file has the following format:
Download: https://huggingface.co/datasets/wallston...pt-dataset