Stack Overflow is the largest and more popular repository of questions and answers related to the development of software. The data available at that repository may be used to identify topics of interest for developers.
This is an Apache Spark application that uses LDA Algorithm in otder to discover frequent subjects related to Spark at Stack Overflow.