The Big Book of Data Science Use Cases


In this article, we’ve shown how Apache Spark and Databricks can be leveraged for time series analysis both directly, by using windowing and wrappers, and indirectly, by using Koalas.

The advantages of using Spark and Koalas for time series analyses include:

  • Parallelize analyses of your time series for risk, fraud or compliance use cases with as-of joins and simple aggregations
  • Iterate faster and create rich time series features with Databricks Connect
  • Arm your data science and quant teams with Koalas to scale out data preparation while not sacrificing pandas ease of use and APIs


Complete this form to download the ebook