Dominic's Homepage!

Profile Pic

View My GitHub Profile

LinkedIn

My name is Dominic Teo and I'm currently an analytics manager with the Ministry of National Development (MND). I'm interested in the intersection of public policy and technology as well as the application of Big Data and Civic Tech in the public sphere.

I graduated Cum Laude from Sciences Po Paris with a BA in Social Sciences & Economics followed by a MS in Computational Analysis and Public Policy (MSCAPP) at the University of Chicago. The MSCAPP degree is a 2 year dual degree offered by the Schools of Computer Science and Public Policy.

Take a look at my resumé and various projects! (click on titles of projects for more details)

Hosted on GitHub Pages — Theme by orderedlist

Netflix Top 10 - Choosing What To Watch Next (2022)

Built using: Python - Flask and BeautifulSoup, Javascript, HTML and hosted on Render

View the Project on GitHub

Background

Recently, Netflix began publicizing its global top 10 most watched films or TV shows. This provides a list of content for people to watch yet besides the title, Netflix’s website does not provide any more information on these films/shows that will help people decide if these films/shows are for them.

This website/program helps scrapes the top 10 lists from Netflix’s website for the following countries:

Countries covered in this program
Australia
Brazil
France
Hong Kong
India
Japan
Singapore
Taiwan
United States
United Kingdom

Users can either select the country in the drop-down list or access the results directly via URL. There are 2 aspects to this program:

  1. Web Scraping Netflix Top 10 data from Netflix’s website.
  2. Using the Netflix Top 10 data as input for the OMDb API to access the film/show’s details as well as YouTube trailer.

The OMDb API and YouTube trailer will then be shown.

Web Scraping the Netflix Top 10 data from Netflix’s website

Utilising OMDb API to get film/show’s info and web scraping the YouTube trailer

  1. Oftentimes the title scrapped is too long (for titles with “:” e.g. 13 Hours: The Secret Soldiers of Benghazi) and the title in the OMDb database is its simplified version (e.g. 13 Hours) hence I will try to split the scrapped title on “:” and will use the first part of the title as input for the OMDb API again.
  2. For TV shows, just search using the title without the season number.