Sounds interesting. Anything that makes reading source code easier is to be welcomed, but I'd like to see the paper, some results, and a working tool before judging this particular 'idea'.
To me the abstract sounds as though they are trying to create a tool that automatically discovers the real organization of the code as compared to the way that it might be actually represented in the various files that make it up at the moment. I very often see code in the big application that I work on (>1M lines) that is placed in one source file where it clearly belongs in another and many modules where there is a very miscellaneous collection of functions that have no obvious relation to each other.
However the submission date is 2017-04-01, also known as All Fools Day.
So could Messrs Amir Saeidi, Jurriaan Hage, Ravi Khadka, and Slinger Jansen please join the conversation (and bring the paper with them)?
Are they trying to apply NLP techniques to software source code? To what end?