I am hoping someone can provide some advice on how to extract data from the following dataset - https://ped.uspto.gov/peds/ If you click on the JSON tab for the entire dataset, I am trying to work with the 2020 portion of the dataset.
Anyways, I am beginning a project to obtain patent examiner statistics (how many office actions are issued per examiner).
The issue is that the dataset is nested - a lot. I would like to first pull each unique patent examiner name and then traverse the dataset based on the patent examiner name to calculate statistics from the “Transaction History”.
In short, are there any good methods in Python for running through a nested JSON dataset? Or should I be looking at tackling this project from a different vantage point.