Webscraping IMDB by Nur Davlatov

Webscraping IMDB by Nur Davlatov https://padlet.com/ndavlatow/q3c7tf36ffosb5j3 en-us 2021-08-05 18:37:42 UTC 2026-02-05 20:59:29 UTC hello@padlet.com https://padlet.net/icons/png/1f3a5.png What is the goal of our project? ndavlatow https://padlet.com/ndavlatow/q3c7tf36ffosb5j3/wish/1670293641 We want to create a project which can help us organize the data being displayed on the IMDB Top 1000 Movie website. Therefore, our goal is it to organize the different variables into different groups such as titles, ranking, turnover etc. ]]> 2021-08-05 18:48:45 UTC https://padlet.com/ndavlatow/q3c7tf36ffosb5j3/wish/1670293641 How did we do it? ndavlatow https://padlet.com/ndavlatow/q3c7tf36ffosb5j3/wish/1670294125 We started creating a project in python. Then we first inspected the website. In our project, we then imported BeautifulSoup, Pandas, and Numpy. Then we used the URL from the IMDB website. We requested to get the URL to get access to the webpage and we used BeautifulSoup to get access of the content that we want to extract from the webpage. Afterwards we initiated the lists for the information we wanted to get from the IMDB website, that is, titles, years, time, IMDB ratings, metascores, votes, and the gross revenue. Then we looked for the class we needed. ]]> 2021-08-05 18:49:30 UTC https://padlet.com/ndavlatow/q3c7tf36ffosb5j3/wish/1670294125 How we proceeded ndavlatow https://padlet.com/ndavlatow/q3c7tf36ffosb5j3/wish/1670294727 Then we initiated the loop to get all the date from each of the movies, and we then stored every div container in movie_div.

We then build our pandas dataframe.

After collecting the data we had to clean the data with pandas. After we only needed to export the data into a CSV file, and that’s it. ]]> 2021-08-05 18:50:24 UTC https://padlet.com/ndavlatow/q3c7tf36ffosb5j3/wish/1670294727 The Result ndavlatow https://padlet.com/ndavlatow/q3c7tf36ffosb5j3/wish/1670294983 Now we can successfully see the different variables we were looking for in an ordered manner. This allows us to easily look for different variables in movies.]]> 2021-08-05 18:50:46 UTC https://padlet.com/ndavlatow/q3c7tf36ffosb5j3/wish/1670294983