Plotly-Resampler 使用教程

2024-09-16 22:05:49作者：卓炯娓

1. 项目介绍

Plotly-Resampler 是一个用于可视化大规模时间序列数据的 Python 库。它通过动态聚合时间序列数据，使得在 Plotly 中可视化大量数据点（超过 100,000 个数据点）变得更加高效和响应迅速。该库的核心功能是通过时间序列数据点选择算法，动态聚合数据，并在用户交互（如平移或缩放）时更新图表。

主要特点

动态聚合：根据当前图表视图动态聚合数据，确保高效响应。
环境无关：支持 Jupyter、VSCode 笔记本、PyCharm 笔记本、Google Colab 等多种环境。
多种聚合算法：支持多种序列聚合算法，用户可以选择或开发自己的聚合方法。

2. 项目快速启动

安装

首先，使用 pip 安装 Plotly-Resampler：

pip install plotly-resampler

基本使用

以下是一个简单的示例，展示如何在 Jupyter 环境中使用 Plotly-Resampler 可视化大规模时间序列数据。

import plotly.graph_objects as go
import numpy as np
from plotly_resampler import FigureResampler

# 生成大量数据
x = np.arange(1_000_000)
noisy_sin = (3 + np.sin(x / 200) + np.random.randn(len(x)) / 10) * x / 1_000

# 创建 FigureResampler 对象
fig = FigureResampler(go.Figure())

# 添加数据
fig.add_trace(go.Scattergl(name='noisy sine', showlegend=True), hf_x=x, hf_y=noisy_sin)

# 显示图表
fig.show_dash(mode='inline')

3. 应用案例和最佳实践

案例1：金融市场数据可视化

在金融市场中，时间序列数据量通常非常庞大。使用 Plotly-Resampler 可以高效地可视化这些数据，同时保持交互性。

import pandas as pd
from plotly_resampler import FigureResampler

# 加载金融数据
data = pd.read_csv('financial_data.csv', parse_dates=['date'])

# 创建 FigureResampler 对象
fig = FigureResampler(go.Figure())

# 添加数据
fig.add_trace(go.Scattergl(x=data['date'], y=data['price'], name='Stock Price'), hf_x=data['date'], hf_y=data['price'])

# 显示图表
fig.show_dash(mode='inline')

案例2：物联网数据可视化

在物联网应用中，传感器数据通常以高频率生成。Plotly-Resampler 可以帮助用户在不影响性能的情况下可视化这些数据。

import numpy as np
from plotly_resampler import FigureResampler

# 生成模拟的物联网数据
timestamps = np.arange(0, 1000000, 1)
sensor_data = np.sin(timestamps / 1000) + np.random.randn(len(timestamps)) / 10

# 创建 FigureResampler 对象
fig = FigureResampler(go.Figure())

# 添加数据
fig.add_trace(go.Scattergl(x=timestamps, y=sensor_data, name='Sensor Data'), hf_x=timestamps, hf_y=sensor_data)

# 显示图表
fig.show_dash(mode='inline')