Transforming data with Relational Operators

  • Creating new relations with joins
  • Reducing data size by sampling
  • Extending Pig with user–defined functions

Filtering data with Pig

  • Consolidating data sets with unions
  • Partitioning data sets with splits
  • Injecting parameters into Pig scripts