Sorry if that title seemed stupid. The super serious title would be "Creating a REST API in Rust using Rocket and Diesel", but thats boring. Anyway...
Here I go with my first post that fully focuses on Rust. After spending a few months doing a bit here and there I decided to just dive right in as I was going through the Rust book at too slow a pace to keep myself interested. So, in this post I decided to write about setting up a simple REST API which is something that I have done in Java plenty of times but with Rust it is a different story.
Anyway, enough with this personal backstory and onto the actual tutorial.
In this post we will be looking creating a REST API in Rust. To do this we will useRocket to setup the API and Diesel to deal with the database.
At the time of writing this post the only databases that Diesel accommodates are Postgres, MySql and Sqlite.
Before we can begin coding we need to sort out our dependencies.[gist https://gist.github.com/lankydan/2b72b641b905ff37af99a512fa5638bd /]
As you can see there's a reasonable amount of crates being used here. Obviously we need the rocket
and diesel
crates whereas the rest are not yet clear. rocket_codegen
is pulled in for some macros. dotenv
to allow us to retrieve environment variables from an external file. r2d2
and r2d2-diesel
for connection pooling to connect to the database, specifically via diesel. Finally, serde
, serde_derive
, serde_json
for serialisation and deserialisation of data that is sent and received by the REST API. One extra note about the diesel
dependency, postgres
has been specified explicitly to include only Postgres modules in the Diesel crate, if we wanted to use a different database or even multiple types within the project we just need to specify them or remove the features
list all together.
There is one last piece of information we need before we can continue. To use Rocket, we must be using a nightly build of Rust since it relies on features not yet included in the stable builds.
I think the best place to start with is setting up Diesel. Once that's done we will have our schema defined (only one table for this post) which we can then use to build up our application.For the purpose of this post I will assume that you have already setup the Diesel CLI. A quick example on how to use it can be found in Diesel's getting started guide along with other information in how to use it. I personally used Postgres solely due to not being able to get everything I needed to run MySQL, which seemed to stem from me using Windows... Postgres on the other hand was nice and easy to get going.
First set theDATABASE_URL
to connect to Postgres with the below command or by adding it to the .env
file manually:
echo DATABASE_URL=postgres://postgres:password@localhost/rust-web-with-rocket > .envHopefully your username and password differs from mine!
Then run diesel setup
to create a database for the project and an empty migrations folder for later use.
For this post, we will be modelling people who can be: inserted, retrieved, updated and deleted from the database. To do this we are going to first need a table to store them in. So lets create our first migration.
diesel migration generate create_people
This creates two new files within a single folder which are then placed in the migrations directory. up.sql
is for upgrading and is where we want to put the SQL to create the table. down.sql
is for downgrading so we can undo the upgrade if necessary, therefore for this example it will drop the people table.
To create the people table we run:
[gist https://gist.github.com/lankydan/40800defe7f02485988bddb5b97477a3 /]
And to undo this creation:
[gist https://gist.github.com/lankydan/7b7bbaa18e8ea1e05fe754f5e4d9f179 /]
To apply this migration we need to run:
diesel migration run
And if we need to undo it right away:
diesel migration redoAt this point we have a people table which we can start inserting data into. Since Diesel is an ORM we are obviously going to start mapping the table to something that represents the it in Rust. To do just that we will use a struct.
[gist https://gist.github.com/lankydan/6ca1eb5d800ce430432e476061a88034 /]
Below is the struct that represents each record in the people table; otherwise named a person. Since I only want this struct to represent a record in the table I decided to provide it with no logic and therefore it does not have a impl
section. There are three Diesel specific attributes here: #[derive(Queryable)]
, #[derive(AsChangeSet)]
and #[table_name]
. #[derive(Queryable)]
will generate the code to retrieve a person from the database and #[derive(AsChangeSet)]
to allow us to use update.set
later on. Finally, #[table_name = "people"]
is required since the plural of person in not people. If this struct was called post and the table posts, like in the Diesel getting started example, the attribute can be removed since the plural of post is posts which matches the table name.
The other attributes are aptly named; #[derive(Serialize)]
and #[derive(Deserialize)]
. These are for accepting/returning JSON into/from the REST API. They both come from the serde
crate. We will look at this more later on in the post.
Before we move any further, we should look at creating our schema. Not a database schema for Postgres, a Rust schema file that uses the table!
macro that does the actual Rust to database mappings for us. If we run the following command:
diesel print-schema > src/schema.rs
The following file is generated:
table! { people (id) { id -> Int4, first_name -> Varchar, last_name -> Varchar, age -> Int4, profession -> Varchar, salary -> Int4, } }
For now we can just ignore this file and carry on going.
Using the Person
struct defined above, we can execute SELECT
and UPDATE
queries. DELETE
doesn't require a struct to map to since we just require the record's ID. Then what about INSERT
? For convenience, Diesel suggests doing it this way, we will use another struct with the sole purpose of being used for inserts.
[gist https://gist.github.com/lankydan/99adc54f48c4cf066446a1fd73e9d11f /]
InsertablePerson
is nearly identical to the Person
struct but with one difference, the id
field is missing. This is because the ID of the record will be generated automatically when inserted, so we have no need to set it ourselves. Other fields could also differ slightly, if we don't want some other fields being set on creation. Similar to the Person
's attributes #[derive(Insertable)]
is added generate the code to insert a new record.
I have also included an utility function from_person
which takes a Person
struct's values and converts it into an InsertablePerson
. This simply removes the id
field in this scenario and allows me to have tidier code in other places. This function isn't 100% necessary and is added due to my coding preferences.
[gist https://gist.github.com/lankydan/bbf521dfb3d5f5380326274a51db49e3 /]
The diesel
module is used to access the insert_into
, update
and delete
functions. diesel::prelude::*
provides access to a range of modules and structs that are generally useful when using Diesel, for this example; PgConnection
and QueryResult
are included in this list. schema::people
is included so we can access the people table from within Rust and execute methods on it. Note that schema::people
is referring back to the people table defined in the schema.rs
file we generated earlier.
Let's look at one of the functions more closely:
[gist https://gist.github.com/lankydan/84ccaf17a2bb8db3fb5abb9acdc0a2d1 /]
As mentioned above, we can access the people table via people::table
thanks to including schema::people
. This example is nice and easy, find
is specified as the query that selects a single record with the provided ID and get_result
executes the query with the connection provided to it.
In my examples QueryResult
is returned from all functions. Diesel returns QueryResult<T>
from most methods and is shorthand for Result<T, Error>
due to the following line:
[gist https://gist.github.com/lankydan/702de21e950da0c0711040b706fd869f /]
Returning QueryResult
allows us to determine what happens if the query fails in whatever way is suitable for where the function is used. If we wanted to return a Person
directly out of the function we could call expect
to log the error there and then.
Also, since I have used Postgres for this post, PgConnection
is used. If we were using one of the other databases Diesel support; MySql for example, MysqlConnection
would be used instead.
Let's look at another one:
[gist https://gist.github.com/lankydan/2a530bf568823ce7724ba3721e1a58d2 /]
This works slightly differently to the earlier get
function. Rather than accessing a function on the people::table
it is passed into another Diesel function, insert_into
. As I mentioned earlier in the post, InsertablePerson
was defined specifically for new records, therefore the values from person
are extracted thanks to the from_person
helper function. Remember that no ID is included on this struct. Like before, get_result
is called again to execute the statement.
PgConnection
come from? Well, let's have a look.
The code below shows how a connection pool is created:
[gist https://gist.github.com/lankydan/d17aff3353380e3f3992f105e91b40b2 /]
Now, I'm not going to lie. This is a straight up copy from the Rocket documentation. That link will probably provide a better explanation than I would but I'll give you a quick run through it. init_pool
creates a new pool of connections for our database which we have specified as PgConnection
s. DbConn
wraps the actual PgConnection
. Finally, FromRequest
allows a DbConn
to be retrieved from Rocket handler functions when included in the input parameters, we will look at an example of this soon.
GET
, POST
, PUT
, DELETE
:
[gist https://gist.github.com/lankydan/c3e9fe301d87c270ba947ad9dbc3d130 /]
Each method is marked with an attribute that specifies what REST verb it accepts along with the path needed to get there. Part of the path is missing as the rest will be defined when the routes are created, so just hold on for a bit... The attributes can also accept a few extra properties to properly specify the behavior of the handler.
Until we look at routing, just assume the base path to these handler methods are localhost:8000/people
.
Let's look at one of the simpler handlers:
[gist https://gist.github.com/lankydan/c5e8a203cad89fad50d686c072a7a1b8 /]
This function returns all the person records stored in the database. It accepts a GET
request thanks to the #[get("/")]
attribute on the function. The path it accepts requests from is localhost:8000/people
as denoted by the "/"
.
To use cURL to send a request to this function we need to execute:
curl localhost:8000/people
This will then return a JSON list of people as specified by the return type of Result<Json<Vec<Person>>, Failure>
. To do this, records are retrieved from database and mapped into their JSON representation. Thanks to the return type of QueryResult<Vec<Person>>
from the all
function, if anything goes wrong at the database level we can then map this to a HTTP status code to represent the error properly. This is why the return type is a Result
. It provides us with an option to either return the records when nothing goes wrong but if anything does, it can return a error status code instead.
Serde finally pops up here, although it does so behind the scenes. Without the #[derive(Serialize)]
attribute we added onto the Person
struct earlier we would not be able to return Json<Vec<Person>>
from this function; the same applies for Json<Person>
.
The error_status
function isn't particularly interesting and doesn't help for this specific example. It simply converts an Error
contained within QueryResult
into a status code. I was only particularly interested in these two scenarios, hence why it either returns NotFound
or InternalServerError
for anything else since I'm lazy (plus most of the other errors would honestly be classed as internal server errors).
The last point to touch on before we look at another handler function, the appearance of DbConn
. The code we wrote earlier for connection pooling allows this. At this point all we need to do is include it in the function parameters and it will retrieve a connection for us.
Let's look at the PUT
handler next:
[gist https://gist.github.com/lankydan/344731e805bd7a9982648fe4f185c0cd /]
The first difference between this function and the previous ALL
example (ignoring request type) is the id
and person
being passed in. The "</id>"
represents the path variable id
and data = "<person">
represents that request body that maps to person
in the functions arguments. The format
property specifies the content of the request body, in other words, the data
property should contain JSON (indicated by application/json
). We can see that it does indeed do just that since person
is of type Json<Person>
.
Serde again shows up here. It is needed to retrieve the Json<Person>
from the request body.
To retrieve the contents of person
we must call into_inner()
, revealing the Person
that was waiting to break out all along... update
is called and the result or error is mapped and returned in the Result
enum. Due to the implementation of error_status
, an error will be thrown if an existing record does not exist with the passed in ID. Whether this is how it should work seems to vary from person to person (according to my googling anyway). If we instead wanted to insert the record if it did not already exist, we would need to handle the Error::NotFound
and instead call similar code to that in the POST
function.
Well we just mentioned it, so we need to look at it now. Below is the POST
function:
[gist https://gist.github.com/lankydan/4c873148524fa652d7b99f660d874f07 /]
This contains similar components to the PUT
function we just looked at. The main difference is the return type. The status code that should be returned from a successful POST
request is 201 Created
rather than 200 Ok
which was used by the previous functions that we looked at. To return a different status code, the Result
should contain status::Created
instead of Json<Person>
directly. This change is what makes it return a 201
status code.
To create the status::Created
struct, the created record along with the path to retrieve it (via a GET
request) must be passed into it's constructor. Passing in the path as an absolute string isn't ideal so I have retrieved the host and port number from the environment variables. This might not be the best way to get this to work... But I spent ages trying to figure out how to get them out of Rocket and gave up in the end.
We should probably also look at Responders in Rocket and how they enrich the returned responses, but this post is already so long so I will instead refer you to the Rocket documentation on the subject.
We are nearly at the end now... Don't give up yet!The handlers are setup to accept requests to the server but before we can use them the we need to set the routes to the different functions. Since all of the functions in this post are related to people it will be mapped to /people
. See the code below on how to do this:
[gist https://gist.github.com/lankydan/426ede1ffedafe52aa965f6a00ca9120 /]
create_routes
is called by the main
function to get everything rolling. ignite
creates a new instance of Rocket
. The handler functions are then mounted onto a base request path of /people
by specifying all of the them inside of routes!
. Finally, launch
starts the application server.
.env
file or create a Rocket.toml
file.
When using a .env
file, the values must follow the format of ROCKET_{PARAM}
where PARAM
is the property you are trying to set. {ADDRESS}
represents the host and {PORT}
is obviously the port number. Taking this information, below is the .env
file used in this post (removing unrelated configuration):
ROCKET_ADDRESS=localhost ROCKET_PORT=8000
If instead you wanted to use a Rocket.toml
file, it would look like the below.
[development] address = "localhost" port = 8000
In this situation, these values are only applicable for development, which is handy since thats all I'm doing.
If you choose to include neither of these Rocket will instead fall back to it's default configuration. So don't worry about needing to do loads of configuration when playing around with Rocket; for local development the defaults are most likely good enough.
For more (and better) explanations of Rocket Configuration, I again recommend looking at their documentation.
Finally we have reached the end. All that is left to do now is create themain
method so the application can be run.
[gist https://gist.github.com/lankydan/3efb9383579a23a1d40d502a2c450068 /]
All main
does is load in the environment variables and starts Rocket by calling create_routes
. The rest of this file just pulls in a load of crates so they don't need to be scattered throughout the rest of the code.
Now you can rest. That was a pretty long post. I'd write a conclusion but honestly, I'm tired and don't want to write anymore. So for a short summary, in this post we have created a simple REST API using Rocket to run an application server that responds to requests and used Diesel to connect to a database to manage the state of the application.
The code used in this post can be on my GitHub.
If you liked this post, then follow me on Twitter at @LankyDanDev to be able to keep up with my new posts as I write them.