google_api_dataflow v0.15.0 API Reference

Modules

API calls for all endpoints tagged Projects.

Handle Tesla connections for GoogleApi.Dataflow.V1b3.

Obsolete in favor of ApproximateReportedProgress and ApproximateSplitRequest.

A progress measurement of a WorkItem by a worker.

A suggestion by the service to the worker to dynamically split the WorkItem.

A structured message reporting an autoscaling decision made by the Dataflow service.

Settings for WorkerPool autoscaling.

Metadata for a BigQuery connector used by the job.

Metadata for a BigTable connector used by the job.

Modeled after information exposed by /proc/stat.

Description of an interstitial value between transforms in an execution stage.

Description of a transform executed as part of an execution stage.

All configuration data for a particular Computation.

A position that encapsulates an inner position and an index for the inner position. A ConcatPosition can be used by a reader of a source that encapsulates a set of other sources.

CounterMetadata includes all static non-name non-value counter attributes.

Identifies a counter within a per-job namespace. Counters whose structured names are the same get merged into a single value for the job.

A single message which encapsulates structured name and metadata for a given counter.

An update to a Counter sent from a worker.

A request to create a Cloud Dataflow job from a template.

Identifies the location of a custom souce.

Data disk assignment for a given VM instance.

Metadata for a Datastore connector used by the job.

Specification of one of the bundles produced as a result of splitting a Source (e.g. when executing a SourceSplitRequest, or when splitting an active task using WorkItemStatus.dynamic_source_split), relative to the source being split.

Describes the data disk used by a workflow job.

Data provided with a pipeline or transform to provide descriptive info.

A metric value representing a distribution.

When a task splits using WorkItemStatus.dynamic_source_split, this message describes the two parts of the split relative to the description of the current task's input.

Describes the environment in which a Dataflow Job runs.

A message describing the state of a particular execution stage.

Description of the composing transforms, names/ids, and input/outputs of a stage of execution. Some composing transforms and sources may have been generated by the Dataflow service during execution planning.

Indicates which [regional endpoint] (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) failed to respond to a request for data.

Metadata for a File connector used by the job.

An instruction that copies its inputs (zero or more) to its (single) output.

A metric value representing a list of floating point numbers.

A representation of a floating point mean metric contribution.

Request to get updated debug configuration for component.

Response to a get debug configuration request.

The response to a GetTemplate request.

Histogram of value counts for a distribution.

Proto describing a hot key detected on a given WorkItem.

An input of an instruction, as a reference to an output of a producer instruction.

A metric value representing temporal values of a variable.

A metric value representing a list of integers.

A representation of an integer mean metric contribution.

Defines a job to be run by the Cloud Dataflow service.

Additional information about how a Cloud Dataflow job will be executed that isn't contained in the submitted job.

Contains information about how a particular google.dataflow.v1beta3.Step will be executed.

A particular message pertaining to a Dataflow job.

Metadata available primarily for filtering jobs. Will be included in the ListJob response and Job SUMMARY view.

JobMetrics contains a collection of metrics describing the detailed progress of a Dataflow job. Metrics correspond to user-defined and system-defined metrics in the job.

Data disk assignment information for a specific key-range of a sharded computation. Currently we only support UTF-8 character splits to simplify encoding into JSON.

Location information for a specific key-range of a sharded computation. Currently we only support UTF-8 character splits to simplify encoding into JSON.

Parameters to provide to the template being launched.

Response to the request to launch a template.

Response to a request to lease WorkItems.

Response to a request to list job messages.

Response to a request to list Cloud Dataflow jobs. This may be a partial response, depending on the page size in the ListJobsRequest.

MapTask consists of an ordered set of instructions, each of which describes one particular low-level operation for the worker to perform in order to accomplish the MapTask's WorkItem.

The metric short id is returned to the user alongside an offset into ReportWorkItemStatusRequest

Identifies a metric, by describing the source which generated the metric.

Describes the state of a metric.

Describes mounted data disk.

Information about an output of a multi-output DoFn.

Basic metadata about a counter.

The packages that must be installed in order for a worker to run the steps of the Cloud Dataflow job that will be assigned to its worker pool.

An instruction that does a ParDo operation. Takes one main input and zero or more side inputs, and produces zero or more outputs. Runs user code.

Describes a particular operation comprising a MapTask.

Structured data associated with this message.

Metadata for a specific parameter.

An instruction that does a partial group-by-key. One input and one output.

A descriptive representation of submitted pipeline as well as the executed form. This data is provided by the Dataflow service for ease of visualizing the pipeline and interpreting Dataflow provided metrics.

Position defines a position within a collection of data. The value can be either the end position, a key (used with ordered collections), a byte offset, or a record index.

Metadata for a PubSub connector used by the job.

Identifies a pubsub location to use for transferring data into or out of a streaming Dataflow job.

An instruction that reads records. Takes no inputs, produces one output.

Request to report the status of WorkItems.

Response from a request to report the status of WorkItems.

Represents the level of parallelism in a WorkItem's input, reported by the worker.

Worker metrics exported from workers. This contains resource utilization metrics accumulated from a variety of sources. For more information, see go/df-resource-signals.

Service-side response to WorkerMessage reporting resource utilization.

The environment values to set at runtime.

The version of the SDK used to run the job.

Request to send encoded debug information.

Response to a send capture request. nothing

A request for sending worker messages to the service.

The response to the worker messages.

Describes a particular function to invoke.

Information about an output of a SeqMapTask.

A task which consists of a shell command for the worker to execute.

Information about a side input of a DoFn or an input of a SeqDoFn.

A sink that records can be encoded and written to.

A source that records can be read and decoded from.

DEPRECATED in favor of DynamicSourceSplit.

A request to compute the SourceMetadata of a Source.

The result of a SourceGetMetadataOperation.

Metadata about a Source useful for automatically optimizing and tuning the pipeline, etc.

A work item that represents the different operations that can be performed on a user-defined Source specification.

The result of a SourceOperationRequest, specified in ReportWorkItemStatusRequest.source_operation when the work item is completed.

Hints for splitting a Source into bundles (parts for parallel processing) using SourceSplitRequest.

Represents the operation to split a high-level Source specification into bundles (parts for parallel processing).

The response to a SourceSplitRequest.

DEPRECATED in favor of DerivedSource.

Metadata for a Spanner connector used by the job.

A representation of an int64, n, that is immune to precision loss when encoded in JSON.

Description of an input or output of an execution stage.

The Status type defines a logical error model that is suitable for different programming environments, including REST APIs and RPC APIs. It is used by gRPC. Each Status message contains three pieces of data: error code, error message, and error details.

Defines a particular step within a Cloud Dataflow job.

Describes a stream of data, either as input to be processed or as output of a streaming Dataflow job.

Streaming appliance snapshot configuration.

Configuration information for a single streaming computation.

Describes full or partial data disk assignment information of the computation ranges.

A task which describes what action should be performed for the specified streaming computation ranges.

A task that carries configuration information for streaming computations.

A task which initializes part of a streaming Dataflow job.

Identifies the location of a streaming side input.

Identifies the location of a streaming computation stage, for stage-to-stage communication.

A metric value representing a list of strings.

A rich message format, including a human readable string, a key for identifying the message, and structured data associated with the message for programmatic consumption.

Taskrunner configuration settings.

Metadata describing a template.

Global topology of the streaming Dataflow job, including all computations and their sharded locations.

Description of the type, names/ids, and input/outputs for a transform.

Response to the validation request.

WorkItem represents basic information about a WorkItem to be executed in the cloud.

The Dataflow service's idea of the current state of a WorkItem being processed by a worker.

Conveys a worker's progress through the work described by a WorkItem.

WorkerHealthReport contains information about the health of a worker.

WorkerHealthReportResponse contains information returned to the worker in response to a health ping.

A report of an event in a worker's lifecycle. The proto contains one event, because the worker is expected to asynchronously send each message immediately after the event. Due to this asynchrony, messages may arrive out of order (or missing), and it is up to the consumer to interpret. The timestamp of the event is in the enclosing WorkerMessage proto.

WorkerMessage provides information to the backend about a worker.

A message code is used to report status and error messages to the service. The message codes are intended to be machine readable. The service will take care of translating these into user understandable messages if necessary.

A worker_message response allows the server to pass information to the sender.

Describes one particular pool of Cloud Dataflow workers to be instantiated by the Cloud Dataflow service in order to perform the computations required by a job. Note that a workflow job may use multiple pools, in order to match the various computational requirements of the various stages of the job.

Provides data to pass through to the worker harness.

Shutdown notification from workers. This is to be sent by the shutdown script of the worker VM so that the backend knows that the VM is being shut down.

Service-side response to WorkerMessage issuing shutdown notice.

An instruction that writes records. Takes one input, produces no outputs.