liken.datasets
liken.datasets.fake_10(backend='pandas', spark_session=None)
Synthetic 10 rows.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
backend
|
liken.types.SupportedBackends
|
One of "pandas", "polars" or "spark". |
'pandas'
|
spark_session
|
pyspark.sql.SparkSession | None
|
The pyspark spark session if requesting data using "spark" backend. |
None
|
Returns:
| Type | Description |
|---|---|
liken.types.UserDataFrame
|
A dataframe, in the defined backend. |
Raises:
| Type | Description |
|---|---|
ValueError
|
if no spark session passed when requesting a spark dataframe. |
Source code in src/liken/datasets.py
liken.datasets.fake_1K(backend='pandas', spark_session=None)
Synthetic 1K (one thousand) rows.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
backend
|
liken.types.SupportedBackends
|
One of "pandas", "polars" or "spark". |
'pandas'
|
spark_session
|
pyspark.sql.SparkSession | None
|
The pyspark spark session if requesting data using "spark" backend. |
None
|
Returns:
| Type | Description |
|---|---|
liken.types.UserDataFrame
|
A dataframe, in the defined backend. |
Raises:
| Type | Description |
|---|---|
ValueError
|
if no spark session passed when requesting a spark dataframe. |
Source code in src/liken/datasets.py
liken.datasets.fake_100K(backend='pandas', spark_session=None)
Synthetic 100K (one hundred thousand) rows.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
backend
|
liken.types.SupportedBackends
|
One of "pandas", "polars" or "spark". |
'pandas'
|
spark_session
|
pyspark.sql.SparkSession | None
|
The pyspark spark session if requesting data using "spark" backend. |
None
|
Returns:
| Type | Description |
|---|---|
liken.types.UserDataFrame
|
A dataframe, in the defined backend. |
Raises:
| Type | Description |
|---|---|
ValueError
|
if no spark session passed when requesting a spark dataframe. |
Source code in src/liken/datasets.py
liken.datasets.fake_1M(backend='pandas', spark_session=None)
Synthetic 1M (one million) rows.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
backend
|
liken.types.SupportedBackends
|
One of "pandas", "polars" or "spark". |
'pandas'
|
spark_session
|
pyspark.sql.SparkSession | None
|
The pyspark spark session if requesting data using "spark" backend. |
None
|
Returns:
| Type | Description |
|---|---|
liken.types.UserDataFrame
|
A dataframe, in the defined backend. |
Raises:
| Type | Description |
|---|---|
ValueError
|
if no spark session passed when requesting a spark dataframe. |