Contra Costa news has obfuscated URLs #173
Labels
enhancement
New feature or request
help wanted
Extra attention is needed
news
Related to scraping news (rather than data)
Some news items in Contra Costa are coming through with obfuscated URLs like:
It might be good to check any URLs that aren’t under the
cchealth.org
orcontracosta.ca.gov
domains and, if they are redirects, substitute the redirect target for the original URL we had. So, for example, the example item above would wind up with the URLhttps://youtu.be/PZXjV4tFFdA
.Nice to have here: while we’re at it, we could add a
youtube
orvideo
tag to news items that link to YouTube.The text was updated successfully, but these errors were encountered: