UNI-MB - logo
UMNIK - logo
 
E-resources
Peer reviewed Open access
  • Time Series FeatuRe Extract...
    Christ, Maximilian; Braun, Nils; Neuffer, Julius; Kempa-Liehr, Andreas W.

    Neurocomputing (Amsterdam), 09/2018, Volume: 307
    Journal Article

    Time series feature engineering is a time-consuming process because scientists and engineers have to consider the multifarious algorithms of signal processing and time series analysis for identifying and extracting meaningful features from time series. The Python package tsfresh (Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests) accelerates this process by combining 63 time series characterization methods, which by default compute a total of 794 time series features, with feature selection on basis automatically configured hypothesis tests. By identifying statistically significant time series characteristics in an early stage of the data science process, tsfresh closes feedback loops with domain experts and fosters the development of domain specific features early on. The package implements standard APIs of time series and machine learning libraries (e.g. pandas and scikit-learn) and is designed for both exploratory analyses as well as straightforward integration into operational data science applications.