make hydra instantiation simpler
We currently have a complicated initialization process, with a lot happening in hydra.py The issue is that we need the number of channels to be able to instantiate the model. This is the historical reason for making a lot of instantiation steps manually (and using model_partial, ...) I see a way to get rid of these manual steps by first instantiating the tokenization/embedding pipelines and passing a cached sample, then updating the config to perform an automatic instantiation. This would result in: * simpler config files * simpler hydra.py * no config_partial, model_partial, ...
issue