{"id":1131,"date":"2023-11-13T14:25:15","date_gmt":"2023-11-13T18:25:15","guid":{"rendered":"https:\/\/ccdhhn.ca\/workshop\/web-scraping-with-pythons-beautiful-soup\/"},"modified":"2023-11-17T09:39:30","modified_gmt":"2023-11-17T13:39:30","slug":"web-scraping-with-pythons-beautiful-soup","status":"publish","type":"tribe_events","link":"https:\/\/ccdhhn.ca\/fr\/workshop\/web-scraping-with-pythons-beautiful-soup\/","title":{"rendered":"Web Scraping with Python&rsquo;s Beautiful Soup"},"content":{"rendered":"<div  class=\"tribe-block tribe-block__additional-field tribe-block__additional-field__dropdown\" >\n\t<h3>Event Language<\/h3>\n\tEnglish<\/div>\n\n\n<div  class=\"tribe-block tribe-block__additional-field tribe-block__additional-field__dropdown\" >\n\t<h3>Format <\/h3>\n\tvirtual\/virtuel<\/div>\n\n\n<div  class=\"tribe-events-single-section tribe-events-section-category tribe-clearfix\" >\n\t<dt class=\"tribe-events-event-categories-label\">Cat\u00e9gorie d\u2019\u00c9v\u00e8nement:<\/dt> <dd class=\"tribe-events-event-categories\"><a href=\"https:\/\/ccdhhn.ca\/workshops\/category\/1-4-hour-workshop\/\" rel=\"tag\">1-4 hour workshop<\/a><\/dd><\/div>\n\n\n\n<p>This workshop will introduce attendees to techniques for scraping information from the web using Python\u2019s Beautiful Soup (bs4) toolkit. We will begin with a basic overview of the \u201canatomy\u201d or structure of a webpage. Students will then learn how to write a script for extracting textual data from websites like Reddit and organizing it into spreadsheets. The second half of the workshop will explore how to use Python\u2019s Pandas library to clean and analyze your data. In addition to technical skills, students are encouraged to engage with critical questions like: What is web scraping for and what can we, as researchers, learn from publicly available data? What are the potential ethical and legal challenges of data harvesting, and how do we do it responsibly?<\/p>\n\n\n<div  class=\"tribe-block tribe-block__organizer__details tribe-clearfix\" >\n\t<div class=\"tribe-block__organizer__title\">\n\t\t<h3><a href=\"https:\/\/ccdhhn.ca\/fr\/organisateur\/sherman-centre-for-digital-scholarship\/\" title=\"Sherman Centre for Digital Scholarship\" target=\"_self\" rel=\"\">Sherman Centre for Digital Scholarship<\/a><\/h3>\n\t<\/div>\n\t\t\t\t<p class=\"tribe-block__organizer__website\"><a href=\"https:\/\/scds.ca\/\" target=\"_blank\" rel=\"noopener noreferrer\">View Organisateur Website<\/a><\/p>\n\t\t<\/div>\n\n\n<div  class=\"tribe-block tribe-block__venue tribe-clearfix\" >\n\t\n\t\n<div class=\"tribe-block__venue__meta\">\n\t<div class=\"tribe-block__venue__name\">\n\t\t<h3><a href=\"https:\/\/ccdhhn.ca\/fr\/lieu\/mcmaster-university\/\" title=\"McMaster University\">McMaster University<\/a><\/h3>\n\t<\/div>\n\n\t\n\t\t\t\t\t\t<address class=\"tribe-block__venue__address\">\n\t\t\t\t<span class=\"tribe-address\">\n\n<span class=\"tribe-street-address\">1280 Main St W<\/span>\n\t\n\t\t<br>\n\t\t<span class=\"tribe-locality\">Hamilton<\/span><span class=\"tribe-delimiter\">,<\/span>\n\n\t<abbr class=\"tribe-region tribe-events-abbr\" title=\"Ontario\">Ontario<\/abbr>\n\n\t<span class=\"tribe-postal-code\">L8S 4L8<\/span>\n\n\t<span class=\"tribe-country-name\">Canada<\/span>\n\n<\/span>\n\n\t\t\t\t\t\t\t<\/address>\n\t\t\n\t\t\n\t\t\t\t\t<span class=\"tribe-block__venue__website\"><a href=\"https:\/\/scds.ca\/\" target=\"_self\" rel=\"external\">View Lieu Website<\/a><\/span><br \/>\n\t\t\t\n\t<\/div>\n\t\n\t<\/div>\n\n\n<div  class=\"tribe-block tribe-block__event-website\" >\n\t<a\n\t\thref=\"https:\/\/libcal.mcmaster.ca\/event\/3741241\"\n\t\ttarget=\"_self\"\n\t\t\t>\n\t\tWebsite\t<\/a>\n<\/div>\n\n\n\t<div  class=\"tribe-block tribe-block__events-link\" >\n\t\t<div class=\"tribe-events tribe-common\">\n\t\t\t<div class=\"tribe-events-c-subscribe-dropdown__container\">\n\t\t\t\t<div class=\"tribe-events-c-subscribe-dropdown\">\n\t\t\t\t\t<div class=\"tribe-common-c-btn-border tribe-events-c-subscribe-dropdown__button\">\n\t\t\t\t\t\t<svg\n\t class=\"tribe-common-c-svgicon tribe-common-c-svgicon--cal-export tribe-events-c-subscribe-dropdown__export-icon\" \taria-hidden=\"true\"\n\tviewBox=\"0 0 23 17\"\n\txmlns=\"http:\/\/www.w3.org\/2000\/svg\"\n>\n\t<path fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M.128.896V16.13c0 .211.145.383.323.383h15.354c.179 0 .323-.172.323-.383V.896c0-.212-.144-.383-.323-.383H.451C.273.513.128.684.128.896Zm16 6.742h-.901V4.679H1.009v10.729h14.218v-3.336h.901V7.638ZM1.01 1.614h14.218v2.058H1.009V1.614Z\" \/>\n\t<path d=\"M20.5 9.846H8.312M18.524 6.953l2.89 2.909-2.855 2.855\" stroke-width=\"1.2\" stroke-linecap=\"round\" stroke-linejoin=\"round\"\/>\n<\/svg>\n\t\t\t\t\t\t<button\n\t\t\t\t\t\t\tclass=\"tribe-events-c-subscribe-dropdown__button-text\"\n\t\t\t\t\t\t\taria-expanded=\"false\"\n\t\t\t\t\t\t\taria-controls=\"tribe-events-subscribe-dropdown-content\"\n\t\t\t\t\t\t\taria-label=\"View links to add events to your calendar\"\n\t\t\t\t\t\t>\n\t\t\t\t\t\t\tAjouter au calendrier\t\t\t\t\t\t<\/button>\n\t\t\t\t\t\t<svg\n\t class=\"tribe-common-c-svgicon tribe-common-c-svgicon--caret-down tribe-events-c-subscribe-dropdown__button-icon\" \taria-hidden=\"true\"\n\tviewBox=\"0 0 10 7\"\n\txmlns=\"http:\/\/www.w3.org\/2000\/svg\"\n>\n\t<path fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M1.008.609L5 4.6 8.992.61l.958.958L5 6.517.05 1.566l.958-.958z\" class=\"tribe-common-c-svgicon__svg-fill\"\/>\n<\/svg>\n\t\t\t\t\t<\/div>\n\t\t\t\t\t<div id=\"tribe-events-subscribe-dropdown-content\" class=\"tribe-events-c-subscribe-dropdown__content\">\n\t\t\t\t\t\t<ul class=\"tribe-events-c-subscribe-dropdown__list\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<li class=\"tribe-events-c-subscribe-dropdown__list-item tribe-events-c-subscribe-dropdown__list-item--gcal\">\n\t\t\t\t\t\t\t\t\t<a\n\t\t\t\t\t\t\t\t\t\thref=\"https:\/\/www.google.com\/calendar\/event?action=TEMPLATE&#038;dates=20240215T000000\/20240215T235959&#038;text=Web%20Scraping%20with%20Python%26rsquo%3Bs%20Beautiful%20Soup&#038;details=%3Cp%3EThis+workshop+will+introduce+attendees+to+techniques+for+scraping+information+from+the+web+using+Python%E2%80%99s+Beautiful+Soup+%28bs4%29+toolkit.+We+will+begin+with+a+basic+overview+of+the+%E2%80%9Canatomy%E2%80%9D+or+structure+of+a+webpage.+Students+will+then+learn+how+to+write+a+script+for+extracting+textual+data+from+websites+like+Reddit+and+organizing+it+into+spreadsheets.+The+second+half+of+the+workshop+will+explore+how+to+use+Python%E2%80%99s+Pandas+library+to+clean+and+analyze+your+data.+In+addition+to+technical+skills%2C+students+are+encouraged+to+engage+with+critical+questions+like%3A+What+is+web+scraping+for+and+what+can+we%2C+as+researchers%2C+learn+from+publicly+available+data%3F+What+are+the+potential+ethical+and+legal+challenges+of+data+harvesting%2C+and+how+do+we+do+it+responsibly%3F%3C%2Fp%3E+&#038;location=McMaster%20University,%201280%20Main%20St%20W,%20Hamilton,%20Ontario,%20L8S%204L8,%20Canada&#038;trp=false&#038;ctz=America\/Halifax&#038;sprop=website:https:\/\/ccdhhn.ca\/fr\/\"\n\t\t\t\t\t\t\t\t\t\tclass=\"tribe-events-c-subscribe-dropdown__list-item-link\"\n\t\t\t\t\t\t\t\t\t\ttarget=\"_blank\"\n\t\t\t\t\t\t\t\t\t\trel=\"noopener noreferrer nofollow noindex\"\n\t\t\t\t\t\t\t\t\t>\n\t\t\t\t\t\t\t\t\t\tGoogle Agenda\t\t\t\t\t\t\t\t\t<\/a>\n\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<li class=\"tribe-events-c-subscribe-dropdown__list-item tribe-events-c-subscribe-dropdown__list-item--ical\">\n\t\t\t\t\t\t\t\t\t<a\n\t\t\t\t\t\t\t\t\t\thref=\"\"\n\t\t\t\t\t\t\t\t\t\tclass=\"tribe-events-c-subscribe-dropdown__list-item-link\"\n\t\t\t\t\t\t\t\t\t\ttarget=\"_blank\"\n\t\t\t\t\t\t\t\t\t\trel=\"noopener noreferrer nofollow noindex\"\n\t\t\t\t\t\t\t\t\t>\n\t\t\t\t\t\t\t\t\t\tiCalendar\t\t\t\t\t\t\t\t\t<\/a>\n\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<li class=\"tribe-events-c-subscribe-dropdown__list-item tribe-events-c-subscribe-dropdown__list-item--outlook-365\">\n\t\t\t\t\t\t\t\t\t<a\n\t\t\t\t\t\t\t\t\t\thref=\"\"\n\t\t\t\t\t\t\t\t\t\tclass=\"tribe-events-c-subscribe-dropdown__list-item-link\"\n\t\t\t\t\t\t\t\t\t\ttarget=\"_blank\"\n\t\t\t\t\t\t\t\t\t\trel=\"noopener noreferrer nofollow noindex\"\n\t\t\t\t\t\t\t\t\t>\n\t\t\t\t\t\t\t\t\t\tOutlook 365\t\t\t\t\t\t\t\t\t<\/a>\n\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<li class=\"tribe-events-c-subscribe-dropdown__list-item tribe-events-c-subscribe-dropdown__list-item--outlook-live\">\n\t\t\t\t\t\t\t\t\t<a\n\t\t\t\t\t\t\t\t\t\thref=\"\"\n\t\t\t\t\t\t\t\t\t\tclass=\"tribe-events-c-subscribe-dropdown__list-item-link\"\n\t\t\t\t\t\t\t\t\t\ttarget=\"_blank\"\n\t\t\t\t\t\t\t\t\t\trel=\"noopener noreferrer nofollow noindex\"\n\t\t\t\t\t\t\t\t\t>\n\t\t\t\t\t\t\t\t\t\tOutlook Live\t\t\t\t\t\t\t\t\t<\/a>\n\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/ul>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n\n","protected":false},"excerpt":{"rendered":"<p>This workshop will introduce attendees to techniques for scraping information from the web using Python\u2019s Beautiful Soup (bs4) toolkit. We will begin with a basic overview of the \u201canatomy\u201d or structure of a webpage. Students will then learn how to write a script for extracting textual data from websites like Reddit and organizing it into spreadsheets. The second half of the workshop will explore how to use Python\u2019s Pandas library to clean and analyze your data. In addition to technical skills, students are encouraged to engage with critical questions like: What is web scraping for and what can we, as researchers, learn from publicly available data? What are the potential ethical and legal challenges of data harvesting, and how do we do it responsibly?<\/p>\n","protected":false},"author":2,"featured_media":1435,"template":"","meta":{"_tec_requires_first_save":false,"_EventAllDay":true,"_EventTimezone":"America\/Halifax","_EventStartDate":"2024-02-15 00:00:00","_EventEndDate":"2024-02-15 23:59:59","_EventStartDateUTC":"2024-02-15 04:00:00","_EventEndDateUTC":"2024-02-16 03:59:59","_EventShowMap":true,"_EventShowMapLink":true,"_EventURL":"https:\/\/libcal.mcmaster.ca\/event\/3741241","_EventCost":"","_EventCostDescription":"","_EventCurrencySymbol":"","_EventCurrencyCode":"","_EventCurrencyPosition":"prefix","_EventDateTimeSeparator":"","_EventTimeRangeSeparator":"","_EventOrganizerID":[1054],"_EventVenueID":[1063],"_OrganizerEmail":"","_OrganizerPhone":"","_OrganizerWebsite":"","_VenueAddress":"","_VenueCity":"","_VenueCountry":"","_VenueProvince":"","_VenueState":"","_VenueZip":"","_VenuePhone":"","_VenueURL":"","_VenueStateProvince":"","_VenueLat":"","_VenueLng":"","_VenueShowMap":false,"_VenueShowMapLink":false,"_tribe_blocks_recurrence_rules":"","_tribe_blocks_recurrence_description":"","_tribe_blocks_recurrence_exclusions":"","_ecp_custom_2":"English","_ecp_custom_3":"virtual\/virtuel","_tribe_events_status":"","_tribe_events_status_reason":"","_tribe_events_is_hybrid":"","_tribe_events_is_virtual":"","_tribe_events_virtual_video_source":"","_tribe_events_virtual_embed_video":"","_tribe_events_virtual_linked_button_text":"","_tribe_events_virtual_linked_button":"","_tribe_events_virtual_show_embed_at":"","_tribe_events_virtual_show_embed_to":[],"_tribe_events_virtual_show_on_event":"","_tribe_events_virtual_show_on_views":"","_tribe_events_virtual_url":"","footnotes":""},"tags":[],"tribe_events_cat":[3],"class_list":["post-1131","tribe_events","type-tribe_events","status-publish","has-post-thumbnail","hentry","tribe_events_cat-1-4-hour-workshop","cat_1-4-hour-workshop"],"_links":{"self":[{"href":"https:\/\/ccdhhn.ca\/fr\/wp-json\/wp\/v2\/tribe_events\/1131","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ccdhhn.ca\/fr\/wp-json\/wp\/v2\/tribe_events"}],"about":[{"href":"https:\/\/ccdhhn.ca\/fr\/wp-json\/wp\/v2\/types\/tribe_events"}],"author":[{"embeddable":true,"href":"https:\/\/ccdhhn.ca\/fr\/wp-json\/wp\/v2\/users\/2"}],"version-history":[{"count":2,"href":"https:\/\/ccdhhn.ca\/fr\/wp-json\/wp\/v2\/tribe_events\/1131\/revisions"}],"predecessor-version":[{"id":1141,"href":"https:\/\/ccdhhn.ca\/fr\/wp-json\/wp\/v2\/tribe_events\/1131\/revisions\/1141"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ccdhhn.ca\/fr\/wp-json\/wp\/v2\/media\/1435"}],"wp:attachment":[{"href":"https:\/\/ccdhhn.ca\/fr\/wp-json\/wp\/v2\/media?parent=1131"}],"wp:term":[{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ccdhhn.ca\/fr\/wp-json\/wp\/v2\/tags?post=1131"},{"taxonomy":"tribe_events_cat","embeddable":true,"href":"https:\/\/ccdhhn.ca\/fr\/wp-json\/wp\/v2\/tribe_events_cat?post=1131"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}