org.apache.spark.sql.execution

Distinct

case class Distinct(partial: Boolean, child: SparkPlan) extends SparkPlan with UnaryNode with Product with Serializable

:: DeveloperApi :: Computes the set of distinct input rows using a HashSet.

partial

when true the distinct operation is performed partially, per partition, without shuffling the data.

child

the input query plan.

Annotations
@DeveloperApi()
Linear Supertypes
Product, Equals, UnaryNode, catalyst.trees.UnaryNode[SparkPlan], SparkPlan, Serializable, Serializable, Logging, QueryPlan[SparkPlan], TreeNode[SparkPlan], AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. Distinct
  2. Product
  3. Equals
  4. UnaryNode
  5. UnaryNode
  6. SparkPlan
  7. Serializable
  8. Serializable
  9. Logging
  10. QueryPlan
  11. TreeNode
  12. AnyRef
  13. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new Distinct(partial: Boolean, child: SparkPlan)

    partial

    when true the distinct operation is performed partially, per partition, without shuffling the data.

    child

    the input query plan.

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. def apply(number: Int): SparkPlan

    Definition Classes
    TreeNode
  7. def argString: String

    Definition Classes
    TreeNode
  8. def asCode: String

    Definition Classes
    TreeNode
  9. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  10. val child: SparkPlan

    the input query plan.

    the input query plan.

    Definition Classes
    Distinct → UnaryNode
  11. def children: List[SparkPlan]

    Definition Classes
    UnaryNode
  12. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  13. val codegenEnabled: Boolean

    Definition Classes
    SparkPlan
  14. def collect[B](pf: PartialFunction[SparkPlan, B]): Seq[B]

    Definition Classes
    TreeNode
  15. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  16. def execute(): RDD[catalyst.expressions.Row]

    Runs this query returning the result as an RDD.

    Runs this query returning the result as an RDD.

    Definition Classes
    DistinctSparkPlan
  17. def executeCollect(): Array[catalyst.expressions.Row]

    Runs this query returning the result as an array.

    Runs this query returning the result as an array.

    Definition Classes
    SparkPlan
  18. def expressions: Seq[Expression]

    Definition Classes
    QueryPlan
  19. def fastEquals(other: TreeNode[_]): Boolean

    Definition Classes
    TreeNode
  20. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  21. def flatMap[A](f: (SparkPlan) ⇒ TraversableOnce[A]): Seq[A]

    Definition Classes
    TreeNode
  22. def foreach(f: (SparkPlan) ⇒ Unit): Unit

    Definition Classes
    TreeNode
  23. def generateTreeString(depth: Int, builder: StringBuilder): StringBuilder

    Attributes
    protected
    Definition Classes
    TreeNode
  24. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  25. def getNodeNumbered(number: MutableInt): SparkPlan

    Attributes
    protected
    Definition Classes
    TreeNode
  26. val id: Long

    Definition Classes
    TreeNode
  27. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  28. def isTraceEnabled(): Boolean

    Attributes
    protected
    Definition Classes
    Logging
  29. def log: Logger

    Attributes
    protected
    Definition Classes
    Logging
  30. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  31. def logDebug(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  32. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  33. def logError(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  34. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  35. def logInfo(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  36. def logName: String

    Attributes
    protected
    Definition Classes
    Logging
  37. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  38. def logTrace(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  39. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  40. def logWarning(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  41. def makeCopy(newArgs: Array[AnyRef]): Distinct.this.type

    Overridden make copy also propogates sqlContext to copied plan.

    Overridden make copy also propogates sqlContext to copied plan.

    Definition Classes
    SparkPlan → TreeNode
  42. def map[A](f: (SparkPlan) ⇒ A): Seq[A]

    Definition Classes
    TreeNode
  43. def mapChildren(f: (SparkPlan) ⇒ SparkPlan): Distinct.this.type

    Definition Classes
    TreeNode
  44. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  45. def newMutableProjection(expressions: Seq[Expression], inputSchema: Seq[Attribute]): () ⇒ MutableProjection

    Attributes
    protected
    Definition Classes
    SparkPlan
  46. def newOrdering(order: Seq[SortOrder], inputSchema: Seq[Attribute]): Ordering[catalyst.expressions.Row]

    Attributes
    protected
    Definition Classes
    SparkPlan
  47. def newPredicate(expression: Expression, inputSchema: Seq[Attribute]): (catalyst.expressions.Row) ⇒ Boolean

    Attributes
    protected
    Definition Classes
    SparkPlan
  48. def newProjection(expressions: Seq[Expression], inputSchema: Seq[Attribute]): Projection

    Attributes
    protected
    Definition Classes
    SparkPlan
  49. def nodeName: String

    Definition Classes
    TreeNode
  50. final def notify(): Unit

    Definition Classes
    AnyRef
  51. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  52. def numberedTreeString: String

    Definition Classes
    TreeNode
  53. def otherCopyArgs: Seq[AnyRef]

    Attributes
    protected
    Definition Classes
    TreeNode
  54. def output: Seq[Attribute]

    Definition Classes
    Distinct → QueryPlan
  55. def outputPartitioning: Partitioning

    Specifies how data is partitioned across different nodes in the cluster.

    Specifies how data is partitioned across different nodes in the cluster.

    Definition Classes
    UnaryNode → SparkPlan
  56. def outputSet: AttributeSet

    Definition Classes
    QueryPlan
  57. val partial: Boolean

    when true the distinct operation is performed partially, per partition, without shuffling the data.

  58. def printSchema(): Unit

    Definition Classes
    QueryPlan
  59. def requiredChildDistribution: Seq[Distribution]

    Specifies any partition requirements on the input data for this operator.

    Specifies any partition requirements on the input data for this operator.

    Definition Classes
    DistinctSparkPlan
  60. def sameInstance(other: TreeNode[_]): Boolean

    Definition Classes
    TreeNode
  61. def schema: catalyst.types.StructType

    Definition Classes
    QueryPlan
  62. def schemaString: String

    Definition Classes
    QueryPlan
  63. def simpleString: String

    Definition Classes
    TreeNode
  64. def sparkContext: SparkContext

    Attributes
    protected
    Definition Classes
    SparkPlan
  65. val sqlContext: SQLContext

    A handle to the SQL Context that was used to create this plan.

    A handle to the SQL Context that was used to create this plan. Since many operators need access to the sqlContext for RDD operations or configuration this field is automatically populated by the query planning infrastructure.

    Attributes
    protected[org.apache.spark]
    Definition Classes
    SparkPlan
  66. def stringArgs: Iterator[Any]

    Attributes
    protected
    Definition Classes
    TreeNode
  67. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  68. def toString(): String

    Definition Classes
    TreeNode → AnyRef → Any
  69. def transform(rule: PartialFunction[SparkPlan, SparkPlan]): SparkPlan

    Definition Classes
    TreeNode
  70. def transformAllExpressions(rule: PartialFunction[Expression, Expression]): Distinct.this.type

    Definition Classes
    QueryPlan
  71. def transformChildrenDown(rule: PartialFunction[SparkPlan, SparkPlan]): Distinct.this.type

    Definition Classes
    TreeNode
  72. def transformChildrenUp(rule: PartialFunction[SparkPlan, SparkPlan]): Distinct.this.type

    Definition Classes
    TreeNode
  73. def transformDown(rule: PartialFunction[SparkPlan, SparkPlan]): SparkPlan

    Definition Classes
    TreeNode
  74. def transformExpressions(rule: PartialFunction[Expression, Expression]): Distinct.this.type

    Definition Classes
    QueryPlan
  75. def transformExpressionsDown(rule: PartialFunction[Expression, Expression]): Distinct.this.type

    Definition Classes
    QueryPlan
  76. def transformExpressionsUp(rule: PartialFunction[Expression, Expression]): Distinct.this.type

    Definition Classes
    QueryPlan
  77. def transformUp(rule: PartialFunction[SparkPlan, SparkPlan]): SparkPlan

    Definition Classes
    TreeNode
  78. def treeString: String

    Definition Classes
    TreeNode
  79. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  80. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  81. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  82. def withNewChildren(newChildren: Seq[SparkPlan]): Distinct.this.type

    Definition Classes
    TreeNode

Inherited from Product

Inherited from Equals

Inherited from UnaryNode

Inherited from catalyst.trees.UnaryNode[SparkPlan]

Inherited from SparkPlan

Inherited from Serializable

Inherited from Serializable

Inherited from Logging

Inherited from QueryPlan[SparkPlan]

Inherited from TreeNode[SparkPlan]

Inherited from AnyRef

Inherited from Any

Ungrouped