Lakehouse
Warning
The functions are not fully tested yet. Use with caution. Please report any issues to the GitHub repository.
lakehouse_create
lakehouse_create(workspace_id: str, display_name: str, description: str = None, enable_schemas: bool = None, await_lro: bool = None, timeout: int = 60 * 5, preview: bool = True) -> requests.Response
Create a lakehouse.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
workspace_id
|
str
|
The id of the workspace to create the lakehouse in. |
required |
display_name
|
str
|
The display name of the lakehouse. |
required |
description
|
str | None
|
The description of the lakehouse. |
None
|
enable_schemas
|
bool | None
|
Whether the lakehouse is schema enabled. |
None
|
await_lro
|
bool | None
|
Whether to await the long running operation. |
None
|
timeout
|
int
|
Timeout for the long running operation (seconds). Defaults to 5 minutes. |
60 * 5
|
preview
|
bool
|
Whether to preview the request. You will be asked to confirm the request before it is executed. Defaults to True. |
True
|
Returns:
Type | Description |
---|---|
Response
|
The response from the request. |
lakehouse_get
lakehouse_get(workspace_id: str, lakehouse_id: str, preview: bool = True) -> requests.Response
Get a lakehouse.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
workspace_id
|
str
|
The id of the workspace to get the lakehouse from. |
required |
lakehouse_id
|
str
|
The id of the lakehouse to get. |
required |
preview
|
bool
|
Whether to preview the request. You will be asked to confirm the request before it is executed. Defaults to True. |
True
|
Returns:
Type | Description |
---|---|
Response
|
The response from the request. |
lakehouse_list
lakehouse_list(workspace_id: str, continuation_token: str = None, preview: bool = True) -> requests.Response
List lakehouses for a workspace.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
workspace_id
|
str
|
The id of the workspace to list lakehouses for. |
required |
continuation_token
|
str | None
|
A token for retrieving the next page of results. |
None
|
preview
|
bool
|
Whether to preview the request. You will be asked to confirm the request before it is executed. Defaults to True. |
True
|
Returns:
Type | Description |
---|---|
Response
|
The response from the request. |
lakehouse_update
lakehouse_update(workspace_id: str, lakehouse_id: str, display_name: str = None, description: str = None, preview: bool = True) -> requests.Response
Update a lakehouse.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
workspace_id
|
str
|
The id of the workspace to update. |
required |
lakehouse_id
|
str
|
The id of the lakehouse to update. |
required |
display_name
|
str | None
|
The display name of the lakehouse. |
None
|
description
|
str | None
|
The description of the lakehouse. |
None
|
preview
|
bool
|
Whether to preview the request. You will be asked to confirm the request before it is executed. Defaults to True. |
True
|
Returns:
Type | Description |
---|---|
Response
|
The response from the request. |
lakehouse_delete
lakehouse_delete(workspace_id: str, lakehouse_id: str, preview: bool = True) -> requests.Response
Delete a lakehouse.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
workspace_id
|
str
|
The id of the workspace to delete. |
required |
lakehouse_id
|
str
|
The id of the lakehouse to delete. |
required |
preview
|
bool
|
Whether to preview the request. You will be asked to confirm the request before it is executed. Defaults to True. |
True
|
Returns:
Type | Description |
---|---|
Response
|
The response from the request. |
lakehouse_run_background_job
lakehouse_run_background_job(workspace_id: str, lakehouse_id: str, job_type: str, table_name: str, schema_name: str = None, v_order: bool = None, z_order_columns: List[str] = None, retention_period: str = None, await_lro: bool = None, timeout: int = 60 * 5, preview: bool = True) -> requests.Response
Run on-demand table maintenance job instance.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
workspace_id
|
str
|
The id of the workspace to create a job for. |
required |
lakehouse_id
|
str
|
The id of the lakehouse to create a job for. |
required |
job_type
|
str
|
The type of the job to create. Must be "TableMaintenance". |
required |
table_name
|
str
|
The name of the table to run the job on. |
required |
schema_name
|
str | None
|
The name of the schema to run the job on. Only applicable for schema enabled lakehouses. |
None
|
v_order
|
bool | None
|
If table should be v-ordered. |
None
|
z_order_columns
|
List[str] | None
|
List of columns to z-order by. |
None
|
retention_period
|
str | None
|
Retention periode in format d:hh:mm:ss. Overrides the default retention period. |
None
|
await_lro
|
bool | None
|
Whether to await the long running operation. |
None
|
timeout
|
int
|
Timeout for the long running operation (seconds). Defaults to 5 minutes. |
60 * 5
|
preview
|
bool
|
Whether to preview the request. You will be asked to confirm the request before it is executed. Defaults to True. |
True
|
Returns:
Type | Description |
---|---|
Response
|
The response from the request. |
lakehouse_list_tables
lakehouse_list_tables(workspace_id: str, lakehouse_id: str, continuation_token: str = None, max_results: int = None, preview: bool = True) -> requests.Response
List tables in a lakehouse.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
workspace_id
|
str
|
The id of the workspace to list tables for. |
required |
lakehouse_id
|
str
|
The id of the lakehouse to list tables for. |
required |
continuation_token
|
str | None
|
A token for retrieving the next page of results. |
None
|
max_results
|
int | None
|
The maximum number of results to return. |
None
|
preview
|
bool
|
Whether to preview the request. You will be asked to confirm the request before it is executed. Defaults to True. |
True
|
Returns:
Type | Description |
---|---|
Response
|
The response from the request. |
lakehouse_load_table
lakehouse_load_table(workspace_id: str, lakehouse_id: str, table_name: str, relative_path: str, path_type: str, format: str = None, header: bool = None, delimiter: str = None, mode: str = None, file_extension: str = None, recursive: bool = None, await_lro: bool = None, timeout: int = 60 * 5, preview: bool = True) -> requests.Response
Load a table.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
workspace_id
|
str
|
The id of the workspace to load the table for. |
required |
lakehouse_id
|
str
|
The id of the lakehouse to load the table for. |
required |
table_name
|
str
|
The name of the table to load. |
required |
relative_path
|
str
|
The relative path to the table to load. |
required |
path_type
|
str
|
The type of the path to load. Either "File" or "Folder". |
required |
format
|
str | None
|
The format of the files to load. Must be "Parquet" or "Csv". |
None
|
header
|
bool | None
|
Whether the file has a header row. Only applicable for csv files. |
None
|
delimiter
|
str | None
|
The delimiter of the csv files. Only applicable for csv files. |
None
|
mode
|
str | None
|
The mode to load the table in. Either "Overwrite" or "Append". |
None
|
file_extension
|
str | None
|
The file extension of the files to load. |
None
|
recursive
|
bool | None
|
Whether to search data files recursively or not, when loading from a folder. |
None
|
await_lro
|
bool | None
|
Whether to await the long running operation. |
None
|
timeout
|
int
|
Timeout for the long running operation (seconds). Defaults to 5 minutes. |
60 * 5
|
preview
|
bool
|
Whether to preview the request. You will be asked to confirm the request before it is executed. Defaults to True. |
True
|
Returns:
Type | Description |
---|---|
Response
|
The response from the request. |