Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Specifies the underlying output data source.
Syntax
format(source)
Parameters
| Parameter | Type | Description |
|---|---|---|
source |
str | Name of the data source, for example 'parquet' or 'console'. |
Returns
DataStreamWriter
Examples
df = spark.readStream.format("rate").load()
df.writeStream.format("text")
# <...streaming.readwriter.DataStreamWriter object ...>
Write a Rate source stream to CSV:
import tempfile
import time
with tempfile.TemporaryDirectory(prefix="format1") as d:
with tempfile.TemporaryDirectory(prefix="format2") as cp:
df = spark.readStream.format("rate").load()
q = df.writeStream.format("csv").option("checkpointLocation", cp).start(d)
time.sleep(5)
q.stop()
spark.read.schema("timestamp TIMESTAMP, value STRING").csv(d).show()