To add a bit more nuance, SO has a question-answer type format, which leads very well into prompt-rely format to train these chat applications. Most of the other sources do not, except for Github issues maybe. Without this question-answer format, there'll be a need for a bigger data labeling effort to train LLMs on new stuff, no?
- Documentation - Open source projects using it - Github issues - Source code - Blogs - Youtube videos
The list goes on