pyspark.pandas.DataFrame.from_dict#

static DataFrame.from_dict(data, orient='columns', dtype=None, columns=None)[source]#

Construct DataFrame from dict of array-like or dicts.

Creates DataFrame object from dictionary by columns or by index allowing dtype specification.

Parameters

datadict: Of the form {field : array-like} or {field : dict}.
orient{‘columns’, ‘index’}, default ‘columns’: The “orientation” of the data. If the keys of the passed dict should be the columns of the resulting DataFrame, pass ‘columns’ (default). Otherwise, if the keys should be rows, pass ‘index’.
dtypedtype, default None: Data type to force, otherwise infer.
columnslist, default None: Column labels to use when orient='index'. Raises a ValueError if used with orient='columns'.

Returns

DataFrame

See also

DataFrame.from_records: DataFrame from structured ndarray, sequence of tuples or dicts, or DataFrame.
DataFrame: DataFrame object creation using constructor.

Examples

By default the keys of the dict become the DataFrame columns:

>>> data = {'col_1': [3, 2, 1, 0], 'col_2': [10, 20, 30, 40]}
>>> ps.DataFrame.from_dict(data)
   col_1  col_2
0      3     10
1      2     20
2      1     30
3      0     40

Specify orient='index' to create the DataFrame using dictionary keys as rows:

>>> data = {'row_1': [3, 2, 1, 0], 'row_2': [10, 20, 30, 40]}
>>> ps.DataFrame.from_dict(data, orient='index').sort_index()
        0   1   2   3
row_1   3   2   1   0
row_2  10  20  30  40

When using the ‘index’ orientation, the column names can be specified manually:

>>> ps.DataFrame.from_dict(data, orient='index',
...                        columns=['A', 'B', 'C', 'D']).sort_index()
        A   B   C   D
row_1   3   2   1   0
row_2  10  20  30  40