diff --git a/README.md b/README.md
index 601ff98..bc9e0ce 100644
--- a/README.md
+++ b/README.md
@@ -1,141 +1,325 @@
 # Reverb
 
-## Overview
-
 Reverb is an efficient and easy-to-use data storage and transport system
-designed for machine learning research. Reverb is most commonly used as a
-prioritized experience replay system in distributed reinforcement learning
-algorithms, but the system also supports other data structure representations
-such as
-[FIFO](https://en.wikipedia.org/wiki/FIFO_\(computing_and_electronics\))
-and
-[priority queues](https://en.wikipedia.org/wiki/Priority_queue).
+designed for machine learning research. Reverb is primarily used as an
+experience replay system for distributed reinforcement learning algorithms but
+the system also supports multiple data structure representations such as FIFO,
+LIFO, and priority queues.
+
+## Table of Contents
+
+-   [Installation](#installation)
+-   [Quick Start](#quick-start)
+-   [Detailed Overview](#detailed-overview)
+    -   [Tables](#tables)
+    -   [Item Selection Strategies](#item-selection-strategies)
+    -   [Rate Limiting](#rate-limiting)
+    -   [Sharding](#sharding)
+    -   [Checkpointing](#checkpointing)
+-   [Citation](#citation)
 
 ## Installation
 
-### Install using pip
-TODO(b/155492840) Explain how to install with pip.
+The recommended way to install Reverb is with `pip`, but we provide docker images
+that can be used to build Reverb from source.
+
+### Using pip
+
+Note: Reverb expects TensorFlow >= 2.3.0 and thus requires the
+[tf-nightly](https://pypi.org/project/tf-nightly/) package until a stable 2.3
+[tensorflow](https://pypi.org/project/tensorflow/) release is available.
+
+```shell
+$ pip install tf-nightly
+$ pip install dm-reverb-nightly
+```
 
 ### Build from source
-TODO(b/155494968): Explain how to build from source.
-
-## Key Benefits
-
-The key benefits of using Reverb (compared with built-in Python data structures
-and other memory storage systems) are as follows:
-
-*   Efficiency
-    *   High performing C++ implementation
-    *   Extremely fast sampling and update operations
-    *   Memory-efficient storage
-*   Usability
-    *   Clean Python API
-    *   Support for both queues and replay tables
-    *   Custom TensorFlow operations for in-graph sampling, inserting, and
-        updating
-*   Consistency
-    *   Support for single-process and distributed settings
-    *   Controlled throughput via rate limiters for reduced impact of external
-        conditions
-
-## The Data Model
-
-![](docs/animations/diagram2.svg)
-
-The above image demonstrates one of the key benefits of Reverb-- a
-memory-efficient data model. Each timestep is only stored once on the server,
-even if it is referenced by multiple items in a single table or by
-multiple tables. Furthermore, the client only writes data to the server
-when necessary.
-
-## Reverb in Practice
-
-![](docs/animations/diagram1.svg)
-
-This animation shows what the state of the server looks like at each step in the
-code block. Although we are manually setting each item to have the same priority
-value of 1.5, items do not need to have the same priority values (and, in
-reality, will likely have differing and dynamically-calculated priority values).
-
-
-For more examples of Reverb in practice, see [Code Examples](#code-examples).
-
-## Further Customizations
-
-Reverb can be customized according to the experiment's requirements. Common
-customizations are as follows:
-
-*   Creating multiple tables referencing the same underlying data
-*   Modifying `sampler` to control the strategy used to select samples
-*   Modifying `remover` to maintain a different set of items in full tables
-*   Using `RateLimiter` to maintain balance between actors and learners
-*   Limiting the number of times each item can be sampled using
-    `max_times_sampled`
-
-### Item Selectors
-
-Reverb defines several types of item selection strategies that can be used for
-sampling or removing (when the table reaches maximum size) data from tables.
-Below is a brief overview of the available selection strategies.
-
-  *   [Prioritized](https://arxiv.org/abs/1511.05952): sample such that the probability of sampling an item is
-      correlated to its specified priority value
-      *   For more details about prioritized sampling and experience replay as
-          used in research, see
-          [Experience Replay in Research](#experience-replay-in-research)
-  *   [MaxHeap](https://en.wikipedia.org/wiki/Heap_\(data_structure\)): sample
-      the item with the highest priority. If multiple items share the same
-      (highest) priority, select the most recently modified item
-  *   [Uniform](https://en.wikipedia.org/wiki/Uniform_distribution): sample
-      from all items with equal probability, thus ignoring priority
-  *   [Lifo](https://en.wikipedia.org/wiki/LIFO): sample the newest item
-  *   [Fifo](https://en.wikipedia.org/wiki/FIFO_\(computing_and_electronics\)):
-      remove the oldest item
-  *   [Lifo](https://en.wikipedia.org/wiki/LIFO): remove the newest item
-  *   [MinHeap](https://en.wikipedia.org/wiki/Heap_\(data_structure\)): remove
-      the item with the minimum priority (or the least recently
-      inserted/updated if multiple items have this same priority)
-  *   *Uniform*: remove from all items with equal probability
-
-### Rate Limiters
-
-`RateLimiters` are a powerful tool in making machine learning experiments more
-robust by mitigating the impact of external factors on experiment results.
-External factors could include noise such as changes in connection overhead or
-hardware efficiency. By explicitly regulating the sampling rate, rate limiters
-will ultimately lead to more robust and reproducible experiments.
-
-The two rate limiters available are:
-
--   `MinSize`: only allows sampling when the table contains a minimum number of
-    items
--   `SampleToInsertRatio`: controls throughput so that the number of times each
-    item is sampled on average remains constant within the user-defined margin
-    of error
-
-Rate limiters block inserts and samples to maintain the desired ratio. This
-behavior controls the rate that learners and actors are able to sample from and
-insert into the replay buffer, regulating external factors such as the speed at
-which the actor can take an environment step.
-
-Consider the following scenario describing a SampleToInsertRatioLimiter with an
-error buffer of 3.0 and a sample-to-insert ratio of 2:
-
-![](docs/animations/diagram3.svg)
-
-One particular group of agent implementations, *single-threaded Python*, require
-extra attention as they are prone to deadlocks caused by the RateLimiter's
-blocking behavior. These risks do not exist in the distributed setting.
-
-## Experience Replay in Research
-
-Prioritized experience replay is one of the most common uses of Reverb. The
-following papers present the relevant state of the art research:
-
-*   [Prioritized Experience Replay](https://arxiv.org/abs/1511.05952)
-*   [Distributed Prioritized Experience Replay](https://arxiv.org/abs/1803.00933)
-*   Queue-based experience replay ([IMPALA](https://arxiv.org/abs/1802.01561)
-    and [hybrid A3C](https://arxiv.org/abs/1611.06256))
-*   [DQN](https://www.nature.com/articles/nature14236)
 
-<!-- TODO(b/154938808): Add Code example(s) -->
+Please see
+[this guide](pip_package/README.md#how-to-develop-and-build-reverb-with-the-docker-containers)
+for details on how to build Reverb from source.
+
+## Quick Start
+
+Starting a Reverb server is as simple as:
+
+```python
+import reverb
+
+server = reverb.Server(tables=[
+    reverb.Table(
+        name='my_table',
+        sampler=reverb.selectors.Uniform(),
+        remover=reverb.selectors.Fifo(),
+        max_size=100,
+        rate_limiter=reverb.rate_limiters.MinSize(1)),
+    ],
+    port=8000
+)
+```
+
+Create a client to communicate with the server:
+
+```python
+client = reverb.Client(‘localhost:8000’)
+print(client.server_info())
+```
+
+Write some data to the table:
+
+```python
+# Creates a single item and data element [0, 1].
+client.insert([0, 1], priorities={'my_table': 1.0})
+```
+
+This creates an item with a reference to a single data element, `0`. An item can
+also reference multiple data elements:
+
+```python
+# Creates three data elements (2, 3, and 4) and a single item `[2, 3, 4]` that
+# references all three of them.
+with client.writer(max_sequence_length=3) as writer:
+  writer.append(2)
+  writer.append(3)
+  writer.append(4)
+  writer.create_item('my_table', num_timesteps=3, priority=1.0)
+```
+
+The items we have added to Reverb can be read by sampling them:
+
+```python
+print(list(client.sample('my_table', num_samples=2)))  # client.sample() returns a generator
+```
+
+Continue with the
+[Reverb Tutorial](https://github.com/deepmind/reverb/tree/master/reverb/examples/demo.ipynb)
+for an interactive tutorial.
+
+## Detailed overview
+
+Experience replay has become an important tool for training off-policy
+reinforcement learning policies. It is used by algorithms such as
+[Deep Q-Networks (DQN)][DQN], [Soft Actor-Critic (SAC)][SAC],
+[Deep Deterministic Policy Gradients (DDPG)][DDPG], and
+[Hindsight Experience Replay][HER], ... However building an efficient, easy to
+use, and scalable replay system can be challenging. For good performance Reverb
+is implemented in C++ and to enable distributed usage it provides a gRPC service
+for adding, sampling, and updating the contents of the tables. Python clients
+expose the full functionality of the service in an easy to use fashion.
+Furthermore native TensorFlow ops are available for performant integration with
+TensorFlow and `tf.data`.
+
+Although originally designed for off-policy reinforcement learning, Reverb's
+flexibility makes it just as useful for on-policy reinforcement -- or even
+(un)supervised learning. Creative users have even used Reverb to store and
+distribute frequently updated data (such as model weights), acting as an
+in-memory light-weight alternative to a distributed file system where each table
+represents a file.
+
+### Tables
+
+A Reverb `Server` consists of one or more tables. A table hold items, and each
+item references one or more data elements. Tables also define sample and
+removal [selection strategies](#item-selection-strategies), a maximum item
+capacity, and a [rate limiter](#rate-limiting).
+
+Multiple items can reference the same data element, even if these items exist in
+different tables. This is because items only contain references to data elements
+(as opposed to a copy of the data itself). This also means that a data element
+is only removed when there exists no item that contains a reference to it.
+
+For example, it is possible to set up one Table as a Prioritized Experience
+Replay (PER) for transitions (sequences of length 2), and another Table as a
+(FIFO) queue of sequences of length 3. In this case the PER data could be used
+to train DQN, and the FIFO data to train a transition model for the environment.
+
+![Using multiple tables](docs/images/multiple_tables_example.png)
+
+Items are automatically removed from the Table when one of two conditions are
+met:
+
+1.  Inserting a new item would cause the number of items in the Table to exceed
+    its maximum capacity.
+
+1.  An item has been sampled more than the maximum number of times permitted by
+    the Table’s rate limiter. Note that not all rate limiters will enforce this.
+
+In both cases, which item to remove is determined by the table’s removal
+strategy. As mentioned earlier, a data element is automatically removed from the
+`Server` when the number of items that references it reaches zero.
+
+Users have full control over how data is sampled and removed from Reverb
+tables. The behavior is primarily controlled by the
+[item selection strategies](#item-selection-strategies) provided to the `Table`
+as the `sampler` and `remover`. In combination with the
+[`rate_limiter`](#rate-limiting) and `max_times_sampled`, a wide range of
+behaviors can be achieved. Some commonly used configurations include:
+
+**Uniform Experience Replay**
+
+A set of the `N=1000` most recently inserted items are maintained. By setting
+`sampler=reverb.selectors.Uniform()`, the probability to select an item is the
+same for all items. Due to `reverb.rate_limiters.MinSize(100)`, sampling
+requests will block until 100 items have been inserted. By setting
+`remover=reverb.selectors.Fifo()` when an item needs to be removed the oldest
+item is removed first.
+
+```python
+reverb.Table(
+     name='my_uniform_experience_replay_buffer',
+     sampler=reverb.selectors.Uniform(),
+     remover=reverb.selectors.Fifo(),
+     max_size=1000,
+     rate_limiter=reverb.rate_limiters.MinSize(100),
+)
+```
+
+Examples of algorithms that make use of uniform experience replay include [SAC]
+and [DDPG].
+
+**Prioritized Experience Replay**
+
+A set of the `N=1000` most recently inserted items. By setting
+`sampler=reverb.selectors.Prioritized(priority_exponent=0.8)`, the probability
+to select an item is proportional to the item's priority.
+
+Note: See [Schaul, Tom, et al.][PER] for the algorithm used in this
+implementation of Prioritized Experience Replay.
+
+```python
+reverb.Table(
+     name='my_prioritized_experience_replay_buffer',
+     sampler=reverb.selectors.Prioritized(0.8),
+     remover=reverb.selectors.Fifo(),
+     max_size=1000,
+     rate_limiter=reverb.rate_limiters.MinSize(100),
+)
+```
+
+Examples of algorithms that make use of Prioritized Experience Replay are DQN
+(and its variants), and
+[Distributed Distributional Deterministic Policy Gradients][D4PG].
+
+**Queue**
+
+Collection of up to `N=1000` items where the oldest item is selected and removed
+in the same operation. If the collection contains 1000 items then insert calls
+are blocked until it is no longer full, if the collection is empty then sample
+calls are blocked until there is at least one item.
+
+```python
+reverb.Table(
+    name='my_queue',
+    sampler=reverb.selectors.Fifo(),
+    removers=reverb.selectors.Fifo(),
+    max_size=1000,
+    max_times_sampled=1,
+    rate_limiter=reverb.rate_limiters.Queue(size=1000),
+)
+
+# Or use the helper classmethod `.queue`.
+reverb.Table.queue(name=’my_queue', max_size=1000)
+```
+
+Examples of algorithms that make use of Queues are
+[IMPALA](https://arxiv.org/abs/1802.01561) and asynchronous implementations of
+[Proximal Policy Optimization](https://arxiv.org/abs/1707.06347).
+
+### Item selection strategies
+
+Reverb defines several selectors that can be used for item sampling or removal:
+
+-   **Uniform:** Sample uniformly among all items.
+-   **Prioritized:** Samples proportional to stored priorities.
+-   **FIFO:** Selects the oldest data.
+-   **LIFO:** Selects the newest data.
+-   **MinHeap:** Selects data with the lowest priority.
+-   **MaxHeap:** Selects data with the highest priority.
+
+Any of these strategies can be used for sampling or removing items from a
+Table. This gives users the flexibility to create customized Tables that best
+fit their needs.
+
+### Rate Limiting
+
+Rate limiters allow users to enforce conditions on when items can be inserted
+and/or sampled from a Table. Here is a list of the rate limiters that are
+currently available in Reverb:
+
+-   **MinSize:** Sets a minimum number of items that must be in the Table before
+    anything can be sampled.
+-   **SampleToInsertRatio:** Sets that the average ratio of inserts to samples
+    by blocking insert and/or sample requests. This is useful for controlling
+    the number of times each item is sampled before being removed.
+-   **Queue:** Items are sampled exactly once before being removed.
+-   **Stack:** Items are sampled exactly once before being removed.
+
+### Sharding
+
+Reverb servers are unaware of each other and when scaling up a system to a multi
+server setup data is not replicated across more than one node. This makes Reverb
+unsuitable as a traditional database but has the benefit of making it trivial to
+scale up systems where some level of data loss is acceptable.
+
+Distributed systems can be horizontally scaled by simply increasing the number
+of Reverb servers. When used in combination with a gRPC compatible load
+balancer, the address of the load balanced target can simply be provided to a
+Reverb client and operations will automatically be distributed across the
+different nodes. You'll find details about the specific behaviors in the
+documentation of the relevant methods and classes.
+
+If a load balancer is not available in your setup or if more control is required
+then systems can still be scaled in almost the same way. Simply increase the
+number of Reverb servers and create separate clients for each server.
+
+### Checkpointing
+
+Reverb supports checkpointing; the state and content of Reverb servers can be
+stored to permanent storage. While pointing, the `Server` serializes all of its
+data and metadata needed to reconstruct it. During this process the `Server`
+blocks all incoming insert, sample, update, and delete requests.
+
+Checkpointing is done with a call from the Reverb `Client`:
+
+```python
+# client.checkpoint() returns the path the checkpoint was written to.
+checkpoint_path = client.checkpoint()
+```
+
+To restore the a `reverb.Server` from a checkpoint:
+
+```python
+checkpointer = reverb.checkpointers.DefaultCheckpointer(path=checkpoint_path)
+# The arguments passed to `tables=` must be the as those used by the `Server`
+# that wrote the checkpoint.
+server = reverb.Server(tables=[...], checkpointer=checkpointer)
+```
+
+Refer to
+[tfrecord_checkpointer.h](https://github.com/deepmind/reverb/tree/master/reverb/cc/platform/tfrecord_checkpointer.h)
+for details on the implementation of checkpointing in Reverb.
+
+## Citation
+
+If you use this code, please cite it as:
+
+```
+@misc{Reverb,
+  title = {{Reverb}: An efficient data storage and transport system for ML research},
+  author = "{Albin Cassirer, Gabriel Barth-Maron, Manuel Kroiss, Eugene Brevdo}",
+  howpublished = {\url{https://github.com/deepmind/reverb}},
+  url = "https://github.com/deepmind/reverb",
+  year = 2020,
+  note = "[Online; accessed 01-June-2020]"
+}
+```
+
+<!-- Links to papers go here -->
+
+[D4PG]: https://arxiv.org/abs/1804.08617
+[DDPG]: https://arxiv.org/abs/1509.02971
+[DQN]: https://www.nature.com/articles/nature14236
+[HER]: https://arxiv.org/abs/1707.01495
+[PER]: https://arxiv.org/abs/1511.05952
+[SAC]: https://arxiv.org/abs/1801.01290
diff --git a/docs/images/multiple_tables_example.png b/docs/images/multiple_tables_example.png
new file mode 100644
index 0000000..e3b89a8
Binary files /dev/null and b/docs/images/multiple_tables_example.png differ