data:image/s3,"s3://crabby-images/e3a4b/e3a4b9b37bec606c99cae253403ab1ffb4b719f1" alt="Python:Data Analytics and Visualization"
Time series plotting
Pandas comes with great support for plotting, and this holds true for time series data as well.
As a first example, let's take some monthly data and plot it:
>>> rng = pd.date_range(start='2000', periods=120, freq='MS') >>> ts = pd.Series(np.random.randint(-10, 10, size=len(rng)), rng).cumsum() >>> ts.head() 2000-01-01 -4 2000-02-01 -6 2000-03-01 -16 2000-04-01 -26 2000-05-01 -24 Freq: MS, dtype: int64
Since matplotlib is used under the hood, we can pass a familiar parameter to plot, such as c for color, or title for the chart title:
>>> ts.plot(c='k', title='Example time series') >>> plt.show()
The following figure shows an example time series plot:
data:image/s3,"s3://crabby-images/ae3f4/ae3f49736e04ccca2a3159b7ce1d9af00d85a13f" alt=""
We can overlay an aggregate plot over 2 and 5 years:
>>> ts.resample('2A').plot(c='0.75', ls='--') >>> ts.resample('5A').plot(c='0.25', ls='-.')
The following figure shows the resampled 2-year plot:
data:image/s3,"s3://crabby-images/2e244/2e2446890987e15f9111e12e2ae3ce439c8e7fca" alt=""
The following figure shows the resample 5-year plot:
data:image/s3,"s3://crabby-images/2d059/2d059aaa288a07cc0c55bae63d62ecd19e8cdfaf" alt=""
We can pass the kind of chart to the plot
method as well. The return value of the plot
method is an AxesSubplot
, which allows us to customize many aspects of the plot. Here we are setting the label values on the X
axis to the year values from our time series:
>>> plt.clf() >>> tsx = ts.resample('1A') >>> ax = tsx.plot(kind='bar', color='k') >>> ax.set_xticklabels(tsx.index.year)
data:image/s3,"s3://crabby-images/7af8c/7af8cd15040f142f3cef77c5f5641009de559e08" alt=""
Let's imagine we have four time series that we would like to plot simultaneously. We generate a matrix of 1000 × 4 random values and treat each column as a separated time series:
>>> plt.clf() >>> ts = pd.Series(np.random.randn(1000), index=pd.date_range('1/1/2000', periods=1000)) >>> df = pd.DataFrame(np.random.randn(1000, 4), index=ts.index, columns=['A', 'B', 'C', 'D']) >>> df = df.cumsum()>>> df.plot(color=['k', '0.75', '0.5', '0.25'], ls='--')
data:image/s3,"s3://crabby-images/cc304/cc304b331e40d05b974649b8e0e488b82fa9ae51" alt=""