BloomfilterIndex: details of performance gain can be found in article writtern here https://medium.com/@22.gautam/bloom-filter-index-in-apache-spark-boosting-query-performance-with-probabilistic-magic-5724545edcbc
Bucketing: details of buketting can be found at https://www.linkedin.com/pulse/difference-between-partitioning-bucketing-spark-vivek-raj-jj4cc/