If there are multiple authors, then that complicates the analysis. Sometimes the author may not be usefully exposed in the HTML, in which case going to the source system makes sense.
If available for export from the CMS, then it can be imported in that manner. Otherwise scraping the information is usually an effective method of getting the author field.