I need a python web scraping tool created for my Alabama news and information website.
The web scraping tool needs to be able to scrape information from multiple links, at least once a day (preferably around 10:00 p.m. Central Time).
I need the results that are scraped from the sites returned in RSS/XML format.
This web scraping tool will be used to scrape news articles about different sports teams from around my coverage area from their athletic sports information department websites. All of the websites are powered by the same company on the same servers with the same server technology. The links that will be inputted into the web scripting tool for scraping the data will be the news archives for the different sports teams for those universities (please see the attached word document for an example of what I'm talking about). The web scraping tool needs to crawl on through to the links provided in the news archives and pull the data from each one of those articles (links) listed, without duplicating an article that has been scraped previously.
Winning freelancer will receive back-end access to my VPS, to be able to set everything up and make sure it's working properly. Any questions please ask.