Example: Analytics on Enron Email
In this example, we will walkthrough how you can use our APIs to extract key features from thousands of emails in seconds.
Goals
We will be analyzing a dataset of emails from the Enron Corporation which was made public during compliance investigation. In this quickstart, we wil use Trellis to set up the extraction and classification of all the emails.
1. Get your API key
First, go to https://dashboard.runtrellis.com/sign-up and create your account. Then, click settings
on the lower right or visit the setting page and copy your API key.
2. Set your API key
3. Create a Project to put your data
Each project has a unique ID. You can use this ID to refer to the project in the data upload process.
4. Define the transformation you want
Transformation is a set of operations that run on all your assets in the project and turn it in to the format you want.
In this example, we want to extract who the email is from, the topic of the email based on our defined taxonomy, people mentioned, and whether this is a compliance risk.
More on how to define transform_params
can be found here.
5. Create event triggers to automatically run the extraction and transformation
6. Upload the data to the project
Since we created the event triggers in the earlier step, the extraction and transformation will automatically run on the data we upload. If you set up the project and transformation in the dashboard, the event triggers should already be set up for you.
If you have files locally instead of presigned URLs, you can use create presigned URLs endpoint to generate the presigned URLs to upload to Trellis.