Implement scrape_question method for scraping info from ROS Answers question URLs #5

ChrisTimperley · 2020-06-29T15:19:58Z

~~Depends on the completion #4~~

You should use requests to fetch the contents of the web page, and lxml to parse the HTML contents into a DocumentTree. You can then use XPath queries to extract the information that you need from the page: https://docs.python-guide.org/scenarios/scrape/

Edit: We can use the AskBot API to programmatically scrape ROS Answers [https://github.com/ASKBOT/askbot-devel/blob/master/askbot/doc/source/api.rst]

def scrape_question(id: str) -> Dict[str, Any]:
    ...

Should use the following API end point: https://answers.ros.org/api/v1/questions/299232/

Requests should be used to interact with the AskBot API

The text was updated successfully, but these errors were encountered:

ChrisTimperley added enhancement New feature or request high-priority labels Jun 29, 2020

ChrisTimperley changed the title ~~Implement url_to_question method for scraping info from ROS Answers question URLs~~ Implement scrape_question method for scraping info from ROS Answers question URLs Jul 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement scrape_question method for scraping info from ROS Answers question URLs #5

Implement scrape_question method for scraping info from ROS Answers question URLs #5

ChrisTimperley commented Jun 29, 2020 •

edited

Loading

Implement scrape_question method for scraping info from ROS Answers question URLs #5

Implement scrape_question method for scraping info from ROS Answers question URLs #5

Comments

ChrisTimperley commented Jun 29, 2020 • edited Loading

ChrisTimperley commented Jun 29, 2020 •

edited

Loading