-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement OODT Crawler for injecting staged products into File Manager #8
Comments
This can be done as follows
The output is as follows
The product is then ingested and looks as follows
As you can see this is much better but it is not quite what we want for example
|
We need to update to a newer version of Apache Tika in order to achieve the above. Right now we use Apache Tika versions
We should be using the most recent version 1.19 |
This can be achieved as follows
I would like someone to write this up so we will discuss this tomorrow. |
This now exists at https://github.com/capstone-coal/coal-sds/wiki/Automation-of-Data-Acquisition-and-Crawling Thank you folks :) |
Once we have implemented #7 we need to look at implementing OODT's Crawler component for injecting staged products into the File Manager.
The text was updated successfully, but these errors were encountered: