pyspark.pandas.DataFrame.from_records#

static DataFrame.from_records(data, index=None, exclude=None, columns=None, coerce_float=False, nrows=None)[source]#

Convert structured or recorded ndarray to DataFrame.

Parameters
datandarray (structured dtype), list of tuples, dict, or DataFrame

Deprecated since version 4.0.0: Passing a DataFrame is deprecated.

indexstring, list of fields, array-like

Field of array to use as the index, alternately a specific set of input labels to use

excludesequence, default None

Columns or fields to exclude

columnssequence, default None

Column names to use. If the passed data do not have names associated with them, this argument provides names for the columns. Otherwise this argument indicates the order of the columns in the result (any names not found in the data will become all-NA columns)

coerce_floatboolean, default False

Attempt to convert values of non-string, non-numeric objects (like decimal.Decimal) to floating point, useful for SQL result sets

nrowsint, default None

Number of rows to read if data is an iterator

Returns
dfDataFrame

Examples

Use dict as input

>>> ps.DataFrame.from_records({'A': [1, 2, 3]})
   A
0  1
1  2
2  3

Use list of tuples as input

>>> ps.DataFrame.from_records([(1, 2), (3, 4)])
   0  1
0  1  2
1  3  4

Use NumPy array as input

>>> ps.DataFrame.from_records(np.eye(3))
     0    1    2
0  1.0  0.0  0.0
1  0.0  1.0  0.0
2  0.0  0.0  1.0