YTMED project

Description

This project is to measure the performance of more than 800 Youtube channels in different categories. I would like to answer the following questions:

  1. Who are the most viewed musicians on YouTube?
  2. What are the best channels to learn English/German?
  3. On what factors does the success of a YouTube channel depend?
  4. What is the most watched news channel?
  5. What are the most viral videos?

The list goes on and on, but let's stop here for now.

Data Sources

Music channels

Origin of the list of artists.

  1. The Official Big Top 40 (UK)
  2. https://www.bigtop40.com/

  3. Spotify
  4. https://open.spotify.com/

  5. Pitchfork Media (USA)
  6. https://pitchfork.com/

  7. Rolling Stone Magazine (USA)
  8. https://www.rollingstone.com/

  9. The Official Chart (UK)
  10. https://www.officialcharts.com/

  11. The Guardian (UK)
  12. https://www.theguardian.com/music

  13. The Billboard Hot 100 (US)
  14. https://www.billboard.com/

  15. Apple Music
  16. https://music.apple.com/

Dataflow

Python script querying the Youtube data API service to obtain the metrics. The query is performed automatically at the same time everyday.

Saving data in a Postgresql database

Output

  1. Table in Postgresql database
  2. Dashboard in Tableau Public
  3. Weekly report in .html format (like a Top 50 chart)