Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Returns the schema of the data source.
It can refer to any field initialized in the __init__ method to infer the data source's schema when users do not explicitly specify it. This method is invoked once when calling spark.read.format(...).load() to get the schema for a data source read operation. If this method is not implemented, and a user does not provide a schema when reading the data source, an exception will be thrown.
Syntax
schema()
Returns
StructType or str
The schema of this data source or a DDL string representing the schema.
Examples
Returns a DDL string:
def schema(self):
return "a INT, b STRING"
Returns a StructType:
def schema(self):
return StructType().add("a", "int").add("b", "string")