• OhNoMoreLemmy@lemmy.ml
    link
    fedilink
    English
    arrow-up
    1
    ·
    3 days ago

    In practice it’s very systematic for small networks. You perform a search over a range of values until you find what works. We know the optimisation gets harder the deeper a network is so you probably won’t go over 3 hidden layers on tabular data (although if you really care about performance on tabular data you would use something that wasn’t a neural network).

    But yes, fundamentally, it’s arbitrary. For each dataset a different architecture might work better, and no one has a good strategy for picking it.

    • Poik@pawb.social
      link
      fedilink
      English
      arrow-up
      2
      ·
      3 days ago

      There are ways to estimate a little more accurately, but the amount of fine tuning that is guesswork and brute force searching is too damn high…