Python

Querying the Google Analytics API using OAUTH

Today I want to talk about querying the Google Analytics API. In the previous posts, we looked at querying Twitter and YouTube APIs. With each of these, we didn’t need to jump through the OAuth hoops of misery that are required for the Google, Facebook, Linkedin etc.. APIs. So today, I wanted to talk about […]

Read more
Python

Wild Wednesday: handling semi structured JSON data

Wild Wednesday posts are all about taming semi or unstructured data. Today, we’re going to look at ingesting JSON data, generated from YARN, using the API; putting it into a dataframe and then outputting that information to a Hive table. JSON data can pose us with problems as it has a flexible schema (i.e. not […]

Read more