Quickstart

In [1]:

Copied!

import genstudio.plot as Plot
import genstudio.plot as Plot

Let's start with a simple line plot. Given a dataset of six [x, y] coordinates,

In [2]:

Copied!

six_points = [[1, 1], [2, 4], [1.5, 7], [3, 10], [2, 13], [4, 15]]
six_points = [[1, 1], [2, 4], [1.5, 7], [3, 10], [2, 13], [4, 15]]

Here is a line plot:

In [3]:

Copied!

Plot.line(six_points)
Plot.line(six_points)

Out[3]:

Plotting with Marks¶

In GenStudio (and Observable Plot), marks are the basic visual elements used to represent data. The line we just used is one type of mark. Other common marks include dot for scatter plots, bar for bar charts, and text for adding labels.

Each mark type has its own set of properties that control its appearance and behavior. For example, with line, we can control the stroke, stroke width, and curve:

In [4]:

Copied!





Plot.line(
    six_points,
    {
        "stroke": "steelblue",  # Set the line color
        "strokeWidth": 3,  # Set the line thickness
        "curve": "natural",  # Set the curve type
    },
)
Plot.line(
    six_points,
    {
        "stroke": "steelblue",  # Set the line color
        "strokeWidth": 3,  # Set the line thickness
        "curve": "natural",  # Set the curve type
    },
)

Out[4]:

To learn more, refer to the Observable Plot documentation.

Layering Marks & Options¶

We can layer multiple marks and add options to plots using the + operator. For example, here we compose a line mark with a dot mark, then add a frame:

In [5]:

Copied!





(
    Plot.line(six_points, {"stroke": "pink", "strokeWidth": 10})
    + Plot.dot(six_points, {"fill": "purple"})
    + Plot.frame()
)
(
    Plot.line(six_points, {"stroke": "pink", "strokeWidth": 10})
    + Plot.dot(six_points, {"fill": "purple"})
    + Plot.frame()
)

Out[5]:

For more advanced layout options, including grids and responsive layouts, see the Layouts guide.

Data & Channels¶

Channels are how we map our data to visual properties of the mark. For many marks, x and y are the primary channels, but others like color, size, or opacity are also common. We typically specify our data and channels separately.

Say we have a list of objects:

In [6]:

Copied!





object_data = [
    {"X": 1, "Y": 2, "CATEGORY": "A"},
    {"X": 2, "Y": 4, "CATEGORY": "B"},
    {"X": 1.5, "Y": 7, "CATEGORY": "C"},
    {"X": 3, "Y": 10, "CATEGORY": "D"},
    {"X": 2, "Y": 13, "CATEGORY": "E"},
    {"X": 4, "Y": 15, "CATEGORY": "F"},
]
object_data = [
    {"X": 1, "Y": 2, "CATEGORY": "A"},
    {"X": 2, "Y": 4, "CATEGORY": "B"},
    {"X": 1.5, "Y": 7, "CATEGORY": "C"},
    {"X": 3, "Y": 10, "CATEGORY": "D"},
    {"X": 2, "Y": 13, "CATEGORY": "E"},
    {"X": 4, "Y": 15, "CATEGORY": "F"},
]

A mark takes data followed by an options dictionary, which specifies how channel names get their values.

There are several ways to specify channel values in Observable Plot:

A string is used to specify a property name in the data object. If it matches, that property's value is used. Otherwise, it's treated as a literal value.
A function will receive two arguments, (data, index), and should return the desired value for the channel. We use Plot.js to insert a JavaScript source string - this function is evaluated within the rendering environment, and not in python.
An array provides explicit values for each data point. It should have the same length as the list passed in the first (data) position.
Other values will be used as a constant for all data points.

In [7]:

Copied!





Plot.dot(
    object_data,
    {
        "x": "X",
        "y": "Y",
        "stroke": Plot.js("(data, index) => data.CATEGORY"),
        "strokeWidth": [1, 2, 3, 4, 5, 6],
        "r": 8,
        "fill": None,
    },
)
Plot.dot(
    object_data,
    {
        "x": "X",
        "y": "Y",
        "stroke": Plot.js("(data, index) => data.CATEGORY"),
        "strokeWidth": [1, 2, 3, 4, 5, 6],
        "r": 8,
        "fill": None,
    },
)

Out[7]:

Data Serialization¶

Data is passed to the JavaScript runtime as JSON with binary buffer support. The serialization process handles various data types:

Data Type	Conversion
Basic types (str, int, bool)	Direct JSON serialization
Binary data (bytes, bytearray, memoryview)	Stored in binary buffers with reference
NumPy/JAX arrays	Converted to binary buffers with dtype and shape metadata
Objects with `for_json` method	`object.for_json()` result is serialized
Datetime objects	Converted to JavaScript `Date`
Iterables (list, tuple)	Recursively serialized
Callable objects	Converted to JavaScript callback functions (widget mode only)

Binary data is handled efficiently by storing the raw bytes in separate buffers rather than base64 encoding in JSON. This is particularly important for large numeric arrays and binary data.

There is a 100mb limit on the size of initial data and subsequent messages (per message).

The serialization process also handles state management for interactive widgets, collecting initial state and synced keys to enable bidirectional updates between Python and JavaScript. For more details on state management, see the State guide.

Widgets vs HTML¶

GenStudio offers two rendering modes:

HTML mode: Renders visualizations as standalone HTML, ideal for embedding in web pages or exporting. Plots persist across kernel restarts.
Widget mode: Renders visualizations as interactive Jupyter widgets. Enables bidirectional communication between Python and JavaScript.

You can choose the rendering mode in two ways:

Globally, using Plot.configure():

In [8]:

Copied!

Plot.configure(display_as="widget")  # Set global rendering mode to widget
Plot.configure(display_as="widget")  # Set global rendering mode to widget

Using a plot's .display_as(...) method:

In [9]:

Copied!





categorical_data = [
    {"category": "A", "value": 10},
    {"category": "B", "value": 20},
    {"category": "C", "value": 15},
    {"category": "D", "value": 25},
]
(
    Plot.dot(categorical_data, {"x": "value", "y": "category", "fill": "category"})
    + Plot.colorLegend()
).display_as("html")
categorical_data = [
    {"category": "A", "value": 10},
    {"category": "B", "value": 20},
    {"category": "C", "value": 15},
    {"category": "D", "value": 25},
]
(
    Plot.dot(categorical_data, {"x": "value", "y": "category", "fill": "category"})
    + Plot.colorLegend()
).display_as("html")

Out[9]:

The global setting affects all subsequent plots unless overridden by .display_as(). You can switch between modes as needed for different use cases.