cc_staff
282
edits
(Created page with "[https://docs.dask.org/en/stable/ Dask] is a flexible library for parallel computing in Python. It provides parallelized NumPy array and Pandas DataFrame objects, and it enables distributed computing in pure Python with access to the PyData stack. ==Installing our wheel== <!--T:15--> The preferred option is to install it using our provided Python [https://pythonwheels.com/ wheel] as follows: :1. Load a Python module, thus <...") |
|||
Line 162: | Line 162: | ||
index = pd.date_range("2021-09-01", periods=2400, freq="1H") | index = pd.date_range("2021-09-01", periods=2400, freq="1H") | ||
df = pd.DataFrame({"a": np.arange(2400)}, index=index) | df = pd.DataFrame({"a": np.arange(2400)}, index=index) | ||
ddf = dd.from_pandas(df, npartitions= | ddf = dd.from_pandas(df, npartitions=6) | ||
result = ddf.a.mean().compute() | result = ddf.a.mean().compute() |