Entrepot (Bonfire v1.0.0-social-rc.3.22)
View SourceMinimal, composable file upload, storage, and streamed data migrations for Elixir apps, flexibly and with minimal dependencies.
:warning: Although it's been used in production for over a year without issue, Entrepôt is experimental and still in active development. Accepting file uploads introduces specific security vulnerabilities. Use at your own risk.
Concepts
Entrepôt intentionally strips file storage logic down to its most composable parts and lets you decide how you want to use them. These components are: storage, upload, locator, and optionally, uploader, which provides a more ergonomic API for the other 3.
It is intentionally agnostic about versions, transformation, validations, etc. Most of the convenience offered by other libraries around these features comes at the cost of locking in dependence on specific tools and hiding complexity. Entrepôt puts a premium on simplicity and explicitness.
So what does it do? Here's a theoretical example of a use case with an Ecto<sup>1</sup> schema, which stores the file retrieved from a URL, along with some additional metadata:
  def create_attachment(upload, user) do
    Multi.new()
    |> Multi.run(:upload, fn _, _ ->
      YourStorage.put(upload, prefix: :crypto.hash(:md5, [user.id, url]) |> Base.encode16())
    end)
    |> Multi.insert(:attachment, fn %{upload: file_id} ->
      %Attachment{file_data: Locator.new!(id: file_id, storage: YourStorage, metadata: %{type: "document"})
    end)
    |> Repo.transaction()
  endThen to access the file:
%Attachment{file_data: file} = attachment
{:ok, contents} = Disk.read(file.id)Storage
A "storage" is a behaviour that implements the following "file-like" callbacks:
- read
- put
- delete
Implementing your own storage is as easy as creating a module that quacks this way. Each callback should accept an optional list of options as the last arg. Which options are supported is up to the module that implements the callbacks.
Upload
Upload is a protocol consisting of the following two functions:
- contents
- name
A storage uses this interface to figure how to extract the file data from a given struct and how to identify it. See Entrepot.Locator for an example of how this protocol can be implemented.
Locator
Locators are the mediators between storages and uploads. They represent where an uploaded file was stored so it can be retrieved. They contain a unique id, the name of the storage to which the file was uploaded, and a map of user defined metadata.
Locator also implements the upload protocol, which means moving a file from one storage to another is straightforward, and very useful for "promoting" a file from temporary (e.g. Disk) to permanent (e.g. S3) storage<sup>2</sup>:
old_file_data = %Locator{id: "/path/to/file.jpg", storage: Disk, metadata: %{}}
{:ok, new_id} = S3.put(old_file_data)`Note: always remember to take care of cleaning up the old file as Entrepot never automatically removes files:
Disk.delete(old_file_data.id)
Uploader
This helper was added in order to support DRYing up storage access. In most apps, there are certain types of assets that will be uploaded and handled in a similar, if not the same way, if only when it comes to where they are stored. You can use the uploader to codify the handling for specific types of assets.
defmodule AvatarUploader do
  use Entrepot.Uploader, storages: [cache: Disk, store: S3]
  def build_options(upload, :cache, opts) do
    Keyword.put(opts, :prefix, "cache/#{Date.utc_today()}")
  end
  def build_options(upload, :store, opts) do
    opts
    |> Keyword.put(:prefix, "users/#{opts[:user_id]}/avatar")
    |> Keyword.drop([:user_id])
  end
  def build_metadata(upload, :store, _), do: [uploaded_at: DateTime.utc_now()]
endThen you can get the files where they need to be without constructing all the options everywhere they might be uploaded: AvatarUploader.store(upload, :store, user_id: 1)
Note: as this example demonstrates, the function can receive arbitrary data and use it to customize how it builds the storage options before they are passed on.
Built-in Integrations
Entrepôt's module design is intended to make it easy to implement your own custom utilities for handling files in the way you need. However, anticipating the most common use cases, that is facilitated with the following optional modules and add-on library.
There are several implementations some common file storages (including S3/Digital Ocean) and uploads (including Plug.Upload).
Storages
Entrepôt ships with the following storage implementations:
Disk
This saves uploaded files to a local disk. It is useful for caching uploads while you validate other data, and/or perform some file processing.
configuration
- To set the root directory where files will be stored: Application.put_env(:entrepot, Entrepot.Storages.Disk, root_dir: "tmp")
options
- prefix: This should be a valid system path that will be appended to the root. If it does not exist, Disk will create it.
- force: If this option is set to a truthy value, Disk will overwrite any existing file at the derived path. Use with caution!
notes
Since it is possible for files with the same name to be uploaded multiple times, Disk needs some additional info to uniquely identify the file. Disk does not overwrite files with the same name by default. To ensure an upload can be stored, the combination of the Upload.name and prefix should be unique.
S3
This storage uploads files to AWS's S3 service. It also works with Digital Ocean Spaces.
configuration
- To set the bucket where files will be stored: Application.put_env(:entrepot, Entrepot.Storages.S3, bucket: "whatever")
options
- prefix: A string to prepend to the upload's key
- s3_options: Keyword list of option that will passed directly to ex_aws_s3
dependencies
Some of the implementations might require further dependencies (currently only S3-compatible storage) that you will also need to add to your project's deps
{:ex_aws, "~> 2.0"}
{:ex_aws_s3, "~> 2.0"}RAM
Uses Elixir's StringIO module to store file contents in memory. Since the "files" are essentially just strings, they will not be persisted and will error if they are read back from a database, for example. However, operations are correspondingly very fast and thus suitable for tests or other temporary file operations.
uploads
There are implementation of the Entrepot.Upload protocol for the following modules:
URI
This is useful for transferring files already hosted elsewhere, for example in cloud storage not controlled by your application, or a TUS server.
You can use it to allow users to post a url string in lieu of downloading and reuploading a file. A Phoenix controller action implementing this feature might look like this:
def attach(conn, %{"attachment" => %{"url" => url}}) when url != "" do
  URI.parse(url)
  |> Disk.put(upload)
  # ...redirect, etc
endnotes
This implementation imposes a hard timeout limit of 15 seconds to download the file from the remote location.
Plug.Upload
This supports multi-part form submissions handled by Plug.
EntrepôtEcto
There is an external library (because it needs Ecto as a dependency) which provides Entrepot.Ecto.Type for Ecto schema fields to easily handle persisting Locator data in your repository.
Note: Entrepôt was originally forked from Capsule
Summary
Functions
Adds metadata to a Locator.
Copies a file from one storage to another.
Resolves the storage module from a Locator.
Functions
Adds metadata to a Locator.
Parameters
- locator: A- Locatorstruct to which metadata will be added.
- key: A key for the metadata (when adding a single key-value pair).
- val: A value for the metadata (when adding a single key-value pair).
- data: A map or keyword list of metadata to be added.
Returns
- {:ok, Locator.t()}: An updated- Locatorstruct with the new metadata.
- {:error, term()}: The original error tuple if given an error tuple.
Examples
iex> Entrepot.add_metadata(%Locator{}, :key, "value")
{:ok, %Locator{metadata: %{key: "value"}}}
iex> Entrepot.add_metadata(%Locator{key: "value"}, %{key2: "value2"})
{:ok, %Locator{metadata: %{key: "value1", key2: "value2"}}}Copies a file from one storage to another.
Parameters
- locator: A- Locatorstruct representing the file to be copied.
- dest_storage: The destination storage module.
- opts: Optional keyword list of options to be passed to the storage modules.
Returns
- {:ok, Locator.t()}: A new- Locatorstruct for the copied file.
- {:error, term()}: An error tuple if the copy operation fails.
Raises
- Raises an error if attempting to copy a file to the same storage.
Examples
iex> Entrepot.copy(%Locator{id: "file.txt", storage: Disk}, S3)
{:ok, %Locator{id: "new_id", storage: S3, metadata: %{copied_from: Disk}}}Resolves the storage module from a Locator.
Parameters
- locator: A- Locatorstruct containing the storage information.
Returns
- The resolved storage module as an atom.
Raises
- InvalidStorage: If the storage module cannot be resolved.
Examples
iex> Entrepot.storage!(%Locator{storage: Disk})
Disk
iex> Entrepot.storage!(%Locator{storage: "Elixir.Disk"})
Disk