Dask: Difference between revisions

Jump to navigation Jump to search
1 byte removed ,  1 year ago
(Created page with "[https://docs.dask.org/en/stable/ Dask] is a flexible library for parallel computing in Python. It provides parallelized NumPy array and Pandas DataFrame objects, and it enables distributed computing in pure Python with access to the PyData stack. ==Installing our wheel== <!--T:15--> The preferred option is to install it using our provided Python [https://pythonwheels.com/ wheel] as follows: :1. Load a Python module, thus <...")
 
Line 162: Line 162:
index = pd.date_range("2021-09-01", periods=2400, freq="1H")
index = pd.date_range("2021-09-01", periods=2400, freq="1H")
df = pd.DataFrame({"a": np.arange(2400)}, index=index)
df = pd.DataFrame({"a": np.arange(2400)}, index=index)
ddf = dd.from_pandas(df, npartitions=10)
ddf = dd.from_pandas(df, npartitions=6)


result = ddf.a.mean().compute()
result = ddf.a.mean().compute()
cc_staff
282

edits

Navigation menu