Cribl Edge & Search Homelab Part 4 - Search
Now that Minio is internet accessible, let’s configure Cribl Search to read that data. In Search, there’s two concepts that are relevant to our configuration: Datasets and Dataset Providers. We will configure a Dataset Provider to access our Minio endpoint on the internet, and a Dataset to query our particular bucket in Minio.
- Go to Cribl Search
- From Edge or Search, select the product hamburger at the top and select
Search - From the Cloud main navigation, click
Search
- From Edge or Search, select the product hamburger at the top and select
- Click
Data - Click
Dataset Providersat the left - Click the
New Dataset Providerbutton - Under
Dataset Provider Type, clickCribl Edgethen selectAmazon S3 - Click
Advanced Settingsto expand more options - Fill out the form like this:

- Add an
ID - Set
Access KeyandSecret Keyto values you setup in Part 1 - Set
Endpointto thehttps://URL you got in Part 3 - Click
Save
You now have a Dataset Provider which is pointing to your Minio. Next, we need to configure a Dataset which uses that provider.
- Click
Data - Click
New Datasetat the right - For
IDenter something that matters to you, you’ll be using in search - Click
Providerand select the provider you added before - For bucket, enter
data/${_time:%Y}/${_time:%m}/${_time:%d}/${_time:%H}/${host}- If you named your bucket something other than
datamake sure to replace that in this string
- If you named your bucket something other than
- For region, select
US East 1 - Your form should look like this:

We now have a dataset which understands the partitioning structure we created in the S3 output. Now we should be able to search our data.
- Click
Searchnext to your dataset
You should see a screen that looks like this:

There’s lots we can do to explore the data we’ve been collecting, but that’s for another post. Check out our documentation for more information on how to craft searches and get value out of your data.
Hope this was helpful and fun! More content coming as I work to explore the data being generated out of my home network.