Gotchas

  • Do not power transform your data before feeding into TabPFN.
  • “We do not have special nan handling built into our model. We replace nan values with zero at test time”

Limitations

  • “We also focused the development of TabPFN to purely numerical datasets without missing values, and while they can be applied to datasets with categorical features and/or missing values, their performance is generally worse.”
  • “we did not consider the existence of many uninformative features in our prior, leading to performance degradation when such features are added”
  • since TabPFN does training AND inference at the same time, inference is slower
  • it works much better for categorization than regression