Current data is structured as sentence fragments:
"He then studied modern history at Magdalen College, Oxford.Following graduation, Chalk obtained a Graduate Diploma in Law with distinction from the City University London, and qualified as a barrister from the Inns of Court School of Law."
We need to be able to identify the subjects being mentioned: eg Law, Modern History in order to populate rows in our dataset.