We are looking for help from someone who has experience with web crawling/scraping. One example of data we want to extract are articles from the Swedish news site SVT. We do have their permission for using the data but they cannot provide us with a bulk download option.
We also want to collect data from the websites of public health and regulatory authorities.
We do have several people who code in the group (we are a biomedical artificial intelligence lab) but some input from someone who has worked with scrapy or other similar tools would speed things up.