Cribl Edge & Search Homelab Part 4 - Search
Now that Minio is internet accessible, let’s configure Cribl Search to read that data. In Search, there’s two concepts that are relevant to our configuration: Datasets and Dataset Providers. We will configure a Dataset Provider to access our Minio endpoint on the internet, and a Dataset to query our particular bucket in Minio.
- Go to Cribl Search
- From Edge or Search, select the product hamburger at the top and select
Search
- From the Cloud main navigation, click
Search
- From Edge or Search, select the product hamburger at the top and select
- Click
Data
- Click
Dataset Providers
at the left - Click the
New Dataset Provider
button - Under
Dataset Provider Type
, clickCribl Edge
then selectAmazon S3
- Click
Advanced Settings
to expand more options - Fill out the form like this:
- Add an
ID
- Set
Access Key
andSecret Key
to values you setup in Part 1 - Set
Endpoint
to thehttps://
URL you got in Part 3 - Click
Save
You now have a Dataset Provider
which is pointing to your Minio. Next, we need to configure a Dataset
which uses that provider.
- Click
Data
- Click
New Dataset
at the right - For
ID
enter something that matters to you, you’ll be using in search - Click
Provider
and select the provider you added before - For bucket, enter
data/${_time:%Y}/${_time:%m}/${_time:%d}/${_time:%H}/${host}
- If you named your bucket something other than
data
make sure to replace that in this string
- If you named your bucket something other than
- For region, select
US East 1
- Your form should look like this:
We now have a dataset which understands the partitioning structure we created in the S3 output. Now we should be able to search our data.
- Click
Search
next to your dataset
You should see a screen that looks like this:
There’s lots we can do to explore the data we’ve been collecting, but that’s for another post. Check out our documentation for more information on how to craft searches and get value out of your data.
Hope this was helpful and fun! More content coming as I work to explore the data being generated out of my home network.