Puid (puid v2.1.0)

Simple, fast, flexible and efficient generation of probably unique identifiers (puid, aka random strings) of intuitively specified entropy using pre-defined or custom characters.

overview

Overview

Puid provides fast and efficient generation of random IDs. For the purposes of Puid, a random ID is considered a random string used in a context of uniqueness, that is, random IDs are a bunch of random strings that are hopefully unique.

Random string generation can be thought of as a transformation of some random source of entropy into a string representation of randomness. A general purpose random string library used for random IDs should therefore provide user specification for each of the following three key aspects:

entropy-source

Entropy source

What source of randomness is being transformed? Puid allows easy specification of the function used for source randomness.

id-characters

ID characters

What characters are used in the ID? Puid provides 16 pre-defined character sets, as well as allows custom character designation, including Unicode

id-randomness

ID randomness

What is the resulting “randomness” of the IDs? Note this isn't necessarily the same as the randomness of the entropy source. Puid allows explicit specification of ID randomness in an intuitive manner.

examples

Examples

Creating a random ID generator using Puid is a simple as:

iex> defmodule(RandId, do: use(Puid))
iex> RandId.generate()
"8nGA2UaIfaawX-Og61go5A"

Options allow easy and complete control of ID generation.

entropy-source-1

Entropy Source

Puid uses :crypto.strong_rand_bytes/1 as the default entropy source. The rand_bytes option can be used to specify any function of the form (non_neg_integer) -> binary as the source:

iex > defmodule(PrngPuid, do: use(Puid, rand_bytes: &:rand.bytes/1))
iex> PrngPuid.generate()
"bIkrSeU6Yr8_1WHGvO0H3M"

id-characters-1

ID Characters

By default, Puid use the RFC 4648 file system & URL safe characters. The chars option can by used to specify any of 16 pre-defined character sets or custom characters, including Unicode:

iex> defmodule(HexPuid, do: use(Puid, chars: :hex))
iex> HexPuid.generate()
"13fb81e35cb89e5daa5649802ad4bbbd"

iex> defmodule(DingoskyPuid, do: use(Puid, chars: "dingosky"))
iex> DingoskyPuid.generate()
"yiidgidnygkgydkodggysonydodndsnkgksgonisnko"

iex> defmodule(DingoskyUnicodePuid, do: use(Puid, chars: "dîñgø$kyDÎÑGØßK¥", total: 2.5e6, risk: 1.0e15))
iex> DingoskyUnicodePuid.generate()
"øßK$ggKñø$dyGîñdyØøØÎîk"

id-randomness-1

ID Randomness

Generated IDs have 128-bit entropy by default. Puid provides a simple, intuitive way to specify ID randomness by declaring a total number of possible IDs with a specified risk of a repeat in that many IDs:

To generate up to 10 million random IDs with 1 in a trillion chance of repeat:

iex> defmodule(MyPuid, do: use(Puid, total: 10.0e6, risk: 1.0e15))
iex> MyPuid.generate()
"T0bFZadxBYVKs5lA"

The bits option can be used to directly specify an amount of ID randomness:

iex> defmodule(Token, do: use(Puid, bits: 256, chars: :hex_upper))
iex> Token.generate()
"6E908C2A1AA7BF101E7041338D43B87266AFA73734F423B6C3C3A17599F40F2A"

module-api

Module API

Puid modules have two functions:

iex> defmodule(AlphanumId, do: use(Puid, total: 10.0e06, risk: 1.0e15, chars: :alphanum))

generate/0

Generates a puid

iex> AlphanumId.generate()
"UKQHTmvASwyhcwGNA"

info/0

Returns a Puid.Info structure consisting of

  • Name of pre-defined Puid.Chars or :custom
  • Source characters
  • Entropy bits
    • May be larger than the specified bits since it is a multiple of the entropy bits per character
  • Entropy bits per character
  • Entropy representation efficiency
    • Ratio of the puid entropy to the bits required for string representation
  • puid string length
  • Entropy source function
iex> AlphanumId.info()
 %Puid.Info{
   char_set: :alphanum,
   characters: "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789",
   entropy_bits: 101.22,
   entropy_bits_per_char: 5.95,
   ere: 0.74,
   length: 17,
   rand_bytes: &:crypto.strong_rand_bytes/1
 }