Since we use data from youtube API, we will use a notebook to find out what is the result from the API, and what we can use?
From above information, we determine which API endpoint we will use.
Resources | Methods | Parameters | Quota Impact | Result |
---|---|---|---|---|
Search | search.list | snippet | 100 units per request | Up to 50 results per request |
Videos | videos.list | snippet | 1 unit per part per request | Up to 50 results per request |
Videos | videos.list | statistics | 1 unit per part per request | Up to 50 results per request |
Videos | videos.list | contentDetails | 1 unit per part per request | Up to 50 results per request |
Channels | snippet | |||
statistics | ||||
topicDetails | ||||
contentDetails |
YouTube allows free accounts to have 10,000 quota units per day, meaning:
Although we cannot get the caption of video through youtube offical api (require permission to edit a video to download caption), we can get caption of video through youtube_transcript_api free, but have some limited in request time per hours and IPBlock. These disadvantages can be bypassed by time.sleep() and use Kaggle, Colab
In the first stage of the project, we will focus on find out what current trends, popular channel, brief attitiude, so we will focus search (to get result from search), video, channel API.
Stage of the project