Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Loads an XML file and returns the result as a DataFrame. If schema is not specified, this function reads the input once to determine the input schema.
Syntax
xml(path, schema=None, **options)
Parameters
| Parameter | Type | Description |
|---|---|---|
path |
str, list, or RDD | One or more input paths, or an RDD of strings storing XML rows. |
schema |
StructType or str, optional | An optional input schema as a StructType object or a DDL-formatted string (for example, 'col0 INT, col1 DOUBLE'). |
Returns
DataFrame
Examples
Write a DataFrame into an XML file and read it back.
import tempfile
with tempfile.TemporaryDirectory(prefix="xml") as d:
spark.createDataFrame(
[{"age": 100, "name": "Alice"}]
).write.mode("overwrite").option("rowTag", "person").format("xml").save(d)
spark.read.option("rowTag", "person").xml(d).show()
# +---+------------+
# |age| name|
# +---+------------+
# |100|Alice|
# +---+------------+