Yeah mostly regex. Plus some heuristics. Works most of the times. Pretty efficient for the simple amount of heuristics it uses.
You can see the code here https://github.com/legalosLOTR/mdb/blob/master/MDB/DBbuilder...
Check out the function get_movie_name!
Yeah it totally would! Thanks for pointing this out.
Btw, I am planning to use another library that has better detection rates - https://github.com/wackou/guessit