I have used python email library to extract files and since the files are encoded we cannot directly use the regular expression. Even I applied the regular expression for to part of email. It is giving me junk emails too.
I am successfully able to get the eml file results in dictionary format. So can’t we just directly convert the result to data frame and analyze it.
Please see the question again. I have updated the new instructions. I am sure you will be able to do it now.