Back in June of this year a 3000+ page pdf of Anthony Fauci’s emails were released in an effort to understand the United States government response to the COVID-19 pandemic. (See https://www.buzzfeednews.com/article/nataliebettendorf/fauci-emails-covid-response for the original Buzzfeed article). I had the privilege of collaborating with Austin Benson (Cornell University) and David Gleich (Purdue University) on a recent data analysis project where we (i) extracted, cleaned, and stored the data in an easy-to-use JSON format for future studies, and (ii) analyzed the data using a number of graph, hypergraph, and tensor-based data analysis tools.
Here’s a look at the community structure in a network where edges indicate email correspondence. Most clusters in the network include one or more government agency heads and task leaders, who have high centrality scores in the network.
All of our code and derived datasets are accessible on my github page: https://github.com/nveldt/fauci-email
Our manuscript can be viewed as a data manual, and also includes a number of interesting findings from our initial analysis. https://arxiv.org/abs/2108.01239