Read in the webscraped data and identify the top authors by publication count.

Use the master search term file to get the list of topics to which each search term belongs. We will use the higher level topics for plotting in place of specific search terms.

Next, set up a Sankey flow diagram to display authors on the LHS (source) and topics on the RHS (sink). The weight of each connection represents the number of publications.

First Authors By Topic

Institution Publications By Topic