docs: MVP plotly-express docs #554

alexpeters1208 · 2024-06-14T14:52:11Z

Minimum required docs for the plotly-express plugin. Here are the outstanding items:

Fill out "other".
Document ecdf once it is implemented.

Directions for testing:

As of 7/17, everything needed for testing is baked into a release. Here's a simple testing environment using pip-installed DH.

# make new dir for testing
mkdir test-dx && cd test-dx

# create env for installs
python -m venv test-dx-venv
source test-dx-venv/bin/activate

# install some necessary things for the build
pip install --upgrade pip setuptools

# install the server, need 35.1 or 34.3
pip install deephaven-server==0.35.1

# install the plugin
pip install deephaven-plugin-plotly-express

# I need to do this to get `which deephaven` to give the correct venv version, you may not
deactivate
source test-dx-venv/bin/activate

# start the server
deephaven server

dsmmcken

nothing huge jumps out on first read. I haven't tried running every block of code. @jnumainville should look through with sharper eyes on the code blocks.

plugins/plotly-express/docs/timeline.md

plugins/plotly-express/docs/area.md

plugins/plotly-express/docs/README.md

plugins/plotly-express/docs/box.md

plugins/plotly-express/docs/histogram.md

plugins/plotly-express/docs/line-3d.md

plugins/plotly-express/docs/multiple-axes.md

plugins/plotly-express/docs/strip.md

plugins/plotly-express/docs/area.md

plugins/plotly-express/docs/candlestick.md

plugins/plotly-express/docs/histogram.md

plugins/plotly-express/docs/line.md

plugins/plotly-express/docs/scatter.md

plugins/plotly-express/docs/bar.md

plugins/plotly-express/docs/box.md

plugins/plotly-express/docs/candlestick.md

plugins/plotly-express/docs/sub-plots.md

plugins/plotly-express/docs/sunburst.md

chipkent · 2024-07-23T20:23:55Z

plugins/plotly-express/docs/timeline.md

+jobs = dx.data.jobs() # import the ticking jobs dataset
+
+# the `by` argument is used to color the bars by another categorical variable
+jobs_resource_tracking = dx.timeline(jobs, x_start="StartTime", x_end="EndTime", y="Job")


This example is identical to the prior example and is not doing what it says it does.

I didn't notice this initially. Getting the example "right" gives some pretty bad results. Putting the code in but leaving this comment open.

plugins/plotly-express/docs/treemap.md

plugins/plotly-express/docs/line.md

# Conflicts: # plugins/plotly-express/docs/scatter.md # plugins/plotly-express/docs/sub-plots.md

plugins/plotly-express/docs/scatter.md

dsmmcken

Sorry - I hadn't mentioned this previously but one of the reasons I pushed for all our example data sets to be deterministic was for testing. This one example is not.

plugins/plotly-express/docs/density_heatmap.md

plugins/plotly-express/docs/box.md

plugins/plotly-express/docs/histogram.md

plugins/plotly-express/docs/area.md

plugins/plotly-express/docs/violin.md

chipkent · 2024-07-26T19:28:40Z

plugins/plotly-express/docs/multiple-axes.md

+cat_dog = stocks.where("sym in `CAT`, `DOG`")
+
+# use `by` to specify the grouping column and order axes left to right with yaxis_sequence
+line_plot_by = dx.line(cat_dog, x="timestamp", y="price", by="sym", yaxis_sequence=[1, 2])


This needs a ticket to assess the library. It is not a problem with your example.

plugins/plotly-express/docs/scatter.md

chipkent · 2024-07-26T19:39:38Z

plugins/plotly-express/docs/multiple-axes.md

+
+### Multiple columns
+
+When two or more response variables appear in separate columns, passing multiple column names to `x` or `y` is the recommended way to create multiple axes.


Take this example:

import deephaven.plot.express as dx gapminder = dx.data.gapminder() # import a ticking version of the Gapminder dataset # get a specific country brazil = gapminder.where("country == `Brazil`") # specify multiple y-axis columns and order axes left to right with yaxis_sequence line_plot_multi = dx.line(brazil, x="year", y=["pop", "gdpPercap"], yaxis_sequence=[1, 2]) line_plot_multi_2 = dx.line(brazil, x="year", y=["pop", "gdpPercap"])

Passing multiple value to y is NOT how the multiple axes are created. That creates two lines. To create separate axes for the lines, you need to specify yaxis_sequence. This interaction is not clear in the prose or example. The example would be better with two plots like I have. Show that providing y=[a,b] gives two lines, and then adding yaxis_sequence puts them on different axes. As it is, the prose is incorrect and doesn't show them how to do these two common cases.

plugins/plotly-express/docs/plot-by.md

plugins/plotly-express/docs/multiple-axes.md

jnumainville · 2024-07-26T21:47:01Z

plugins/plotly-express/docs/plot-by.md

@@ -1,5 +1,146 @@
 # Plot By

-To plot multiple series from a table into a single chart, use the `by` parameter. This parameter accepts a column name or a list of column names. The chart will be partitioned by the values in the specified column(s), with one series for each unique value. Other parameters, such as `color` (for which `by` is an alias), `symbol`, `size`, `width`, and `line_dash` can also be used to partition the chart.
+To plot multiple series from a table into a single chart, use the `by` parameter. This parameter accepts a column name or a list of column names denoting other variables of interest in the dataset. The chart will be partitioned by the values in the specified column(s), with one series for each unique value. Other parameters, such as `color` (for which `by` is an alias), `symbol`, `size`, `width`, and `line_dash` can also be used to partition the chart.


it should be clarified that by is not simply an alias for color
it can be tweaked by using by_vars and passing in these other columns such as symbol and size
it also behaves slightly differently though, take this example

import deephaven.plot.express as dx tips = dx.data.tips() # import the example iris data set by_list = dx.scatter(tips, x="TotalBill", y="Tip", by=["Time", "Smoker"], by_vars=["color", "symbol"]) by_prod = dx.scatter(tips, x="TotalBill", y="Tip", by="Time", symbol="Smoker")

by_list just loops through color/symbol combos (jointly)
whereas by_prod assigns colors to specific column values, so, of the four joint values, Lunch and Dinner have the same color and Yes and No have the same symbol.

The first method is more useful to emphasize differences, whereas the second is more useful to emphasize similarities.

I removed the "for which by" is an alias" part - hopefully that cleans up the confusion. As far as using by_vars, that seems like something that should belong in the expanded version of this doc once the full thing is written, and not necessarily in the introduction. What do you think?

Yeah, shouldn't be in the intro, that's fine

plugins/plotly-express/docs/plot-by.md

chipkent · 2024-07-27T18:29:30Z

plugins/plotly-express/docs/plot-by.md


-Under the hood, the Deephaven query engine performs a `parition_by` table operation on the given color column to create each series. This efficient implementation means that plots with multiple groups can easily scale to tens of millions or billions of rows with ease.
+Under the hood, the Deephaven query engine performs a `partition_by` table operation on the given grouping column to create each series. This efficient implementation means that plots with multiple groups can easily scale to tens of millions or billions of rows with ease.


Ultimately, partition_by should be linked, but I don't know that @dsmmcken is far enough down the new impl to worry about this yet.

plugins/plotly-express/docs/sidebar.json

alexpeters1208 added 3 commits June 11, 2024 16:20

Docs

d124531

Continue docs

af94f9f

More docs

6907460

alexpeters1208 requested review from dsmmcken and jnumainville June 14, 2024 14:52

alexpeters1208 self-assigned this Jun 14, 2024

alexpeters1208 marked this pull request as draft June 14, 2024 14:52

alexpeters1208 and others added 9 commits June 18, 2024 17:20

Add polar and ternary examples

8b56f55

Add multiple-axes

2fbf7b2

Start re-wording

03c067b

Fix spacing

275aa7d

Start "what are they useful for"

50ee789

Small language changes

566fadc

When are they appropriate

6d15404

Update ecdf

ad907ba

Merge branch 'main' into dx-min-docs

1e21a4d

alexpeters1208 marked this pull request as ready for review June 21, 2024 18:15

alexpeters1208 added 2 commits June 21, 2024 13:29

Update notes and warnings

b615e3e

Simplify "what are they useful for"

1ad4acd

alexpeters1208 requested review from chipkent and margaretkennedy June 21, 2024 19:00

dsmmcken requested changes Jun 24, 2024

View reviewed changes

plugins/plotly-express/docs/timeline.md Outdated Show resolved Hide resolved

plugins/plotly-express/docs/timeline.md Outdated Show resolved Hide resolved

plugins/plotly-express/docs/area.md Outdated Show resolved Hide resolved

Don Area suggestion

d872a90

margaretkennedy reviewed Jun 24, 2024

View reviewed changes

plugins/plotly-express/docs/README.md Outdated Show resolved Hide resolved

margaretkennedy reviewed Jun 25, 2024

View reviewed changes

plugins/plotly-express/docs/box.md Outdated Show resolved Hide resolved

jnumainville requested changes Jun 26, 2024

View reviewed changes

margaretkennedy reviewed Jun 27, 2024

View reviewed changes

plugins/plotly-express/docs/candlestick.md Outdated Show resolved Hide resolved

plugins/plotly-express/docs/candlestick.md Outdated Show resolved Hide resolved

plugins/plotly-express/docs/histogram.md Outdated Show resolved Hide resolved

alexpeters1208 added 3 commits June 28, 2024 09:54

More review suggestions

d39b2b7

Funnel plot

643003a

Funnel, funnel area, timeline

6eb71fb

More plot by

d6638b6

chipkent reviewed Jul 23, 2024

View reviewed changes

alexpeters1208 added 5 commits July 24, 2024 16:11

First round of revisions from Chip

86c1b73

Scatter progress

4e4cc91

Merge branch 'main' into dx-min-docs

6ef84bc

# Conflicts: # plugins/plotly-express/docs/scatter.md # plugins/plotly-express/docs/sub-plots.md

Revise scatter

b4a186f

More polish

ab8f2c4

dsmmcken reviewed Jul 25, 2024

View reviewed changes

alexpeters1208 added 3 commits July 25, 2024 17:00

More polish, Don review, add density heatmap

778246e

Pascal case

b5665a5

Links

a562b64

dsmmcken requested changes Jul 26, 2024

View reviewed changes

plugins/plotly-express/docs/density_heatmap.md Outdated Show resolved Hide resolved

alexpeters1208 added 2 commits July 26, 2024 12:29

Revise concept pieces

a432741

Deterministic large datasets

c408bf2

This was referenced Jul 26, 2024

Category order is data-dependent when using by #674

Open

Strip plot does not actually render data points by default #548

Open

chipkent reviewed Jul 26, 2024

View reviewed changes

Chip review

f3c7ebe

chipkent reviewed Jul 26, 2024

View reviewed changes

plugins/plotly-express/docs/plot-by.md Outdated Show resolved Hide resolved

plugins/plotly-express/docs/multiple-axes.md Outdated Show resolved Hide resolved

jnumainville requested changes Jul 26, 2024

View reviewed changes

Chip and Joe suggestions

83fcb87

chipkent previously approved these changes Jul 27, 2024

View reviewed changes

dsmmcken reviewed Jul 29, 2024

View reviewed changes

plugins/plotly-express/docs/sidebar.json Outdated Show resolved Hide resolved

Move density heatmap up

6dc0ed6

alexpeters1208 dismissed chipkent’s stale review via 6dc0ed6 July 29, 2024 15:27

dsmmcken approved these changes Jul 29, 2024

View reviewed changes

jnumainville approved these changes Jul 29, 2024

View reviewed changes

alexpeters1208 merged commit 4c556d3 into main Jul 29, 2024
14 checks passed

alexpeters1208 deleted the dx-min-docs branch July 29, 2024 15:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: MVP plotly-express docs #554

docs: MVP plotly-express docs #554

alexpeters1208 commented Jun 14, 2024 •

edited

Loading

dsmmcken left a comment

chipkent Jul 23, 2024

alexpeters1208 Jul 24, 2024

alexpeters1208 Jul 24, 2024

dsmmcken left a comment

chipkent Jul 26, 2024

chipkent Jul 26, 2024

jnumainville Jul 26, 2024

alexpeters1208 Jul 26, 2024

jnumainville Jul 26, 2024

chipkent Jul 27, 2024


		### Multiple columns

		When two or more response variables appear in separate columns, passing multiple column names to `x` or `y` is the recommended way to create multiple axes.


		Under the hood, the Deephaven query engine performs a `parition_by` table operation on the given color column to create each series. This efficient implementation means that plots with multiple groups can easily scale to tens of millions or billions of rows with ease.
		Under the hood, the Deephaven query engine performs a `partition_by` table operation on the given grouping column to create each series. This efficient implementation means that plots with multiple groups can easily scale to tens of millions or billions of rows with ease.

docs: MVP plotly-express docs #554

docs: MVP plotly-express docs #554

Conversation

alexpeters1208 commented Jun 14, 2024 • edited Loading

dsmmcken left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dsmmcken left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alexpeters1208 commented Jun 14, 2024 •

edited

Loading