Name	Name	Last commit message	Last commit date
parent directory ..
config	config
src	src
README.md	README.md
pom.xml	pom.xml

Archiving SDII stream data in Protobuf

This example shows how to use the Data Archiving Library to quickly develop an archiving solution that archives data in Protobuf format. There are two options for using this example:

Use the example as-is. You can run the example out of the box by creating an input and output catalog, then specifying those catalogs in the application.conf file. For information on configuring this file, see the Update the Configuration File section below.
Create your own archiving application using the example as a template.

You should review this entire readme regardless of whether you are running the example as-is or using the example as a template for your own application.

The example consists of one user defined function implementation example class:

ProtobufSimpleKeyExample.java

This class implements the SimpleUDF interface (one value per indexing attribute) from the Data Archiving Library. For details on this and other interfaces, see the API Reference section of the Data Archiving Library Developer Guide.

This readme contains important instructions that will help you to run the Data Archiving Library examples.

Get Your Credentials

To run this example, you need two sets of credentials:

Platform credentials: To get access to the platform data and resources, including HERE Map Content data for your pipeline input.
Repository credentials: To download HERE Data SDK for Java & Scala libraries and Maven archetypes to your environment.

For more details on how to set up your credentials, see the Identity & Access Management Developer Guide.

For more details on how to verify that your platform credentials are configured correctly, see the Verify Your Credentials tutorial.

Configure a Project

To follow this example, you will need a project. A project is a collection of platform resources (catalogs, pipelines, and schemas) with controlled access. You can create a project through the HERE platform portal.

Alternatively, use the OLP CLI olp project create command to create the project:

olp project create $PROJECT_ID $PROJECT_NAME

The command returns the HERE Resource Name (HRN) of your new project. Note down this HRN as you will need it later in this tutorial.

Note

You do not have to provide a --scope parameter if your app has a default scope. For details on how to set a default project scope for an app, see the Specify a default Project for Apps chapter of the Identity & Access Management Developer Guide.

For more information on how to work with projects, see the Organize your work in projects tutorial.

Create Input Catalog and Layer

The examples require you to have a catalog with stream layer for the input data. Create a new catalog with stream layer or use an existing one.

Use the HERE platform portal to create the input catalog in your project and add the following layers:

Layer ID	Layer Type	Content Type	Content Encoding	Coverage
stream	Stream	application/octet-stream	uncompressed	-

For instructions on how to create a catalog, see Create a Catalog in the Data User Guide.
For instructions on how to create a layer, see Create a Layer in the Data User Guide.
For instructions on how to link a resource to a project, see Project Resources Link in the Command Line Interface Developer Guide.
For instructions on how to share your project, see Manage Projects in the Identity & Access Management Developer Guide.

Alternatively, you can use the OLP CLI Commands in the Command Line Interface Developer Guide instead of the platform portal to create a new catalog with a stream layer:

Note

We recommend to set values to variables so that you can easily copy and execute the following commands.

Use the olp catalog create command to create the catalog. Make sure to note down the HRN returned by the following command for later use:

olp catalog create $CATALOG_ID $CATALOG_NAME --summary CATALOG_SUMMARY \
        --description CATALOG_DESCRIPTION --scope $PROJECT_HRN

Use the olp catalog layer add command to add a stream layer to your catalog:

olp catalog layer add $CATALOG_HRN stream stream --stream --summary "stream" \
        --description "stream" --content-type application/octet-stream \
        --scope $PROJECT_HRN

Create Output Catalog and Layer

The examples require that you define indexing attributes with the following index definitions for the index layer:

ingestionTime should be declared as timewindow type. There should be one, and ONLY one timewindow column and it is mandatory for any index layer.
tileId should be declared as heretile type. There should be at most one heretile column. It is optional.
eventType should be declared as string.

Use the HERE platform portal to create the output catalog in your project and add the following layers:

Layer ID	Layer Type	Retention	Timewindow Attribute Name	Duration	Content Type	Content Encoding	Coverage
index	Index	7 days	ingestionTime	60	application/x-protobuf	uncompressed	-

For instructions on how to create a catalog, see Create a Catalog in the Data User Guide.
For instructions on how to create a layer, see Create a Layer in the Data User Guide.
For instructions on how to link a resource to a project, see Project Resources Link in the Command Line Interface Developer Guide.
For instructions on how to share your project, see Manage Projects in the Identity & Access Management Developer Guide.

Alternatively, you can use the OLP CLI Commands in the Command Line Interface Developer Guide instead of the platform portal to create a new catalog with an index layer:

Use the olp catalog create command to create the catalog. Make sure to note down the HRN returned by the following command for later use:

olp catalog create $CATALOG_ID $CATALOG_NAME --summary CATALOG_SUMMARY \
        --description CATALOG_DESCRIPTION --scope $PROJECT_HRN

Use the olp catalog layer add command to add an index layer to your catalog:

olp catalog layer add $CATALOG_HRN index index --index --summary "index" \
        --description "index" --content-type application/x-protobuf --ttl 7.days \
        --index-definitions tileId:heretile:8 ingestionTime:timewindow:3600000 eventType:string \
        --scope $PROJECT_HRN

Update the Configuration File

The configurations are specified in the application.conf file that can be found in the /src/main/resources directory of the example project. You will have to modify it to configure expected behavior of the data archiving project. For information about modifying this file, see the comments in the application.conf file.

Note

You must configure this file in order for the examples to run successfully.

Note

The property aggregation.window-seconds is different from the index attribute of type timewindow. The property aggregation.window-seconds determines how frequently data will be aggregated and processed by the data archiving pipeline. The index attribute of type timewindow contains the finest time granularity at which the data will be indexed and later queried, as specified in the attribute's duration field.

Package the Application Into a Fat JAR

To run archiving pipeline in the HERE platform, you need to build a fat JAR.

You can refer to the pom.xml file for packaging details if you are creating your own application. If you are not creating your own application, run the following command under the example's base directory.

mvn clean package

Once the above command is successful, a fat JAR named data-archive-protobuf-example-<VERSION>-platform.jar will be built in the target folder.

Create a Pipeline on the Platform

To run the example, create a pipeline in the HERE platform to execute the application.

Configure and Run as a Platform Pipeline

You should use application jar with suffix platform.jar as a pipeline template to run the archiving process. The pipeline will use the configuration from file application.conf in the src/main/resource/ directory, ensure these values are updated before compiling and uploading your jar.

Using the Platform Portal to Run a Pipeline

For information on using the platform portal to configure and run a pipeline, see Deploying a Pipeline via Web Platform Portal in the Pipelines Developer Guide. Update the logging level of your pipeline from WARN to INFO if you intend to verify message upload in logs.

Use the Command Line Interface to Run a Pipeline

You can use the OLP CLI Commands in the Command Line Interface Developer Guide to create pipeline components and activate it.

First, configure data sources using the config/pipeline-config.conf file. This file contains the configuration of the data sources which are used as placeholders for Data Archiving Library examples:

pipeline.config {
  output-catalog {hrn = "YOUR_OUTPUT_CATALOG_HRN"}
  input-catalogs {
    source {hrn = "YOUR_INPUT_CATALOG_HRN"}
  }
}

You must replace YOUR_OUTPUT_CATALOG_HRN with the HRN of your output catalog and YOUR_INPUT_CATALOG_HRN with the HRN of your input catalog. To find the HRN, in the HERE platform portal navigate to your catalog. The HRN is displayed in the upper left corner of the page.

You can use the OLP CLI to create pipeline components and activate the pipeline version with the following commands:

Create pipeline components:

 olp pipeline create $PIPELINE_NAME --scope $PROJECT_HRN
 olp pipeline template create $PIPELINE_TEMPLATE_NAME stream-4.0 $PATH_TO_JAR com.here.platform.dal.DALMain \
      --input-catalog-ids=source --workers=2 --scope $PROJECT_HRN # Note that the value of workers should be greater than or equal to the value of parallelism selected in application.conf
 olp pipeline version create $PIPELINE_VERSION_NAME $PIPELINE_ID $PIPELINE_TEMPLATE_ID \
      "$PATH_TO_CONFIG_FOLDER/pipeline-config.conf" --scope $PROJECT_HRN

Set log level:

olp pipeline version log level set $PIPELINE_ID $PIPELINE_VERSION_ID --root info --scope $PROJECT_HRN

Activate the pipeline version:

olp pipeline version activate $PIPELINE_ID $PIPELINE_VERSION_ID --scope $PROJECT_HRN

For more information on using the OLP CLI to configure and run a pipeline, see Pipeline Commands in the Command Line Interface Developer Guide.

Note

The selection of input and output catalog values using the portal when creating pipeline version or the values added to pipeline.config when using command line interface must be valid. These values represent a placeholder that the Data Archiving Library will not use. Instead, values will be taken from the application.conf file. If you want to change or update the input/output catalogs, modify the application.conf file and rebuild.

Verify the Output

In the HERE platform portal, select the Pipelines tab and find your pipeline.

Verify pipeline is in Running state.

After the pipeline is running, you can ingest your data into the stream layer created in the Create Input Catalog and Layer section using one of the following:

Publish to a Stream Layer in the Data API Developer Guide
Publish Data in the Data Client Library Developer Guide
Stream in the Command Line Interface Developer Guide

Note

For ingesting data, make sure your app has read and write permission to your stream layer. For instructions on how to manage your app, see Manage Apps the Identity & Access Management Developer Guide.

olp catalog permission grant $CATALOG_HRN --app $APP_ID --read --write --scope $PROJECT_HRN
olp catalog layer stream put $CATALOG_HRN stream --input /path/to/directory --scope $PROJECT_HRN # There are sdii example messages in the folder: src/test/resources/sampleData

After your data is archived in index layer, you can query/retrieve data using one of the following:

Get Data from an Index Layer in the Data API Developer Guide
Get Data in the Data Client Library Developer Guide

Partitions in the Command Line Interface Developer Guide

Note

For querying metadata or retrieving data, make sure your app has read permission to your index layer. For instructions on how to manage your app, see Manage Apps the Identity & Access Management Developer Guide.

olp catalog permission grant $CATALOG_HRN --app $APP_ID --read --write --scope $PROJECT_HRN
olp catalog layer partition list $CATALOG_HRN index --filter <query> --scope $PROJECT_HRN # E.g: <query>="tileId==92259"
olp catalog layer partition get $CATALOG_HRN index --filter <query> --scope $PROJECT_HRN

To parse the data retrieved from index layer, see "How to parse the output content" in FAQ in the Data Archiving Library Developer Guide.

Run Example Locally

Besides running this example in a pipeline, you can also Run an Archiving Application Locally, see the Data Archiving Library Developer Guide. Running locally in your IDE or Flink cluster will use the configuration from file application.conf in the src/test/resource/ directory, ensure these values are updated before compiling and uploading your jar. To create a new application and get credentials.properties, see Manage Apps in the Identity & Access Management Developer Guide. You will share read/write permissions with this app when you create your stream and index layers.

Note

This example provides the Maven profile add-dependencies-for-IDEA, which compiles the necessary dependencies in order to run an application locally from your IDE. When running this example from your IDE, ensure you have this profile enabled in the Maven Toolbar. Alternatively, you can select the checkbox for Include dependencies with "Provided" scope in Edit Configurations for ProtobufExampleRunner.java.

Troubleshooting

If you have any trouble about accessing logs, monitoring, investigating failures and so on, see FAQ in the Data Archiving Library Developer Guide.

References

HERE Platform Portal
- RoW: https://platform.here.com
- China: https://platform.hereolp.cn
Data Archiving Library Developer Guide
- RoW: https://developer.here.com/documentation/data-archiving-library/dev_guide/index.html
- China: https://developer.here.com/cn/documentation/data-archiving-library/dev_guide/index.html
Command Line Interface Developer Guide
- RoW: https://developer.here.com/documentation/open-location-platform-cli/user_guide/index.html
- China: https://developer.here.com/cn/documentation/open-location-platform-cli/user_guide/index.html
Data API Developer Guide
- RoW: https://developer.here.com/documentation/data-api/data_dev_guide/index.html
- China: https://developer.here.com/cn/documentation/data-api/data_dev_guide/index.html
Data Client Library Developer Guide
- RoW: https://developer.here.com/documentation/data-client-library/dev_guide/index.html
- China: https://developer.here.com/cn/documentation/data-client-library/dev_guide/index.html
Data User Guide
- RoW: https://developer.here.com/documentation/data-user-guide/index.html
- China: https://developer.here.com/cn/documentation/data-user-guide/index.html
Pipelines Developer Guide
- RoW: https://developer.here.com/documentation/pipeline/index.html
- China: https://developer.here.com/cn/documentation/pipeline/index.html
Identity & Access Management Developer Guide
- RoW: https://developer.here.com/documentation/identity-access-management/dev_guide/index.html
- China: https://developer.here.com/documentation/identity-access-management/dev_guide/index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

protobuf-example

protobuf-example

README.md

Archiving SDII stream data in Protobuf

Get Your Credentials

Configure a Project

Note

Create Input Catalog and Layer

Note

Create Output Catalog and Layer

Update the Configuration File

Note

Note

Package the Application Into a Fat JAR

Create a Pipeline on the Platform

Configure and Run as a Platform Pipeline

Using the Platform Portal to Run a Pipeline

Use the Command Line Interface to Run a Pipeline

Note

Verify the Output

Note

Note

Run Example Locally

Note

Troubleshooting

References

HERE Platform Portal

Data Archiving Library Developer Guide

Command Line Interface Developer Guide

Data API Developer Guide

Data Client Library Developer Guide

Data User Guide

Pipelines Developer Guide

Identity & Access Management Developer Guide

Files

protobuf-example

Directory actions

More options

Directory actions

More options

Latest commit

History

protobuf-example

Folders and files

parent directory

README.md

Archiving SDII stream data in Protobuf

Get Your Credentials

Configure a Project

Note

Create Input Catalog and Layer

Note

Create Output Catalog and Layer

Update the Configuration File

Note

Note

Package the Application Into a Fat JAR

Create a Pipeline on the Platform

Configure and Run as a Platform Pipeline

Using the Platform Portal to Run a Pipeline

Use the Command Line Interface to Run a Pipeline

Note

Verify the Output

Note

Note

Run Example Locally

Note

Troubleshooting

References

HERE Platform Portal

Data Archiving Library Developer Guide

Command Line Interface Developer Guide

Data API Developer Guide

Data Client Library Developer Guide

Data User Guide

Pipelines Developer Guide

Identity & Access Management Developer Guide