sparkSpark’s primary abstraction is a distributed collection of items called a Dataset. Datasets can be created from Hadoop InputFormats (such as HDFS files) or by transforming other Datasets.Spark collaboration with tools for teams. Spark is the perfect tool for businesses, allowing you to compose, delegate and manage emails directly with your colleagues - use inbox collaboration