In The Stack
a 6.4 TB dataset of permissively licensed source code in 358 programming languages.
Five of our repositories are included in the 25.74 GB of source code (spread across 4,730,461 files) written in Go included in this dataset.
Transparency is a key factor in the development of datasets and derived models. To check, whether your code is included, visit: Am I in The Stack?