Studying fine tuning and multitask learning in an angle discrimination task

Highlights:

Develops multiple ways to measure Fisher information neural networks
- We observe very 'rough' Fisher information measurements, in the sense that the can change quickly with small changes in inputs
Fisher information iterations show increasing randomness, due to diffusion, without compensating adaptation on small scales.
- There are also strong effects of the pre-training in the Fisher information, which show up as small-scale correlations between different fine-tune replicates.
Models do specialize to regions of the domain. However, they are often able to extrapolate well outside of the concentrated regions, and the scale of the difference between regions is similar to the noise between replicates, and to the variability of the Fisher information regions.
- This concentration does effect later fine-tuning. The results appear reproducible, but again fairly small compared to the variation between initializations.
- Potentially effects from loss functions, but this is hard to assess.
Many choices about the models have an effect on the observed patterns:
- Smoother (differentiable) non-linearities result in smoother Fisher information.
- Decoding method impacts Fisher information distribution: angular encodings result in multiple-peaked Fisher information.
- Train-set vs test-set images produce different distributions of encoded values, due to generalization.
Conditioning on these specific choices, we do see loss-function dependent concentration of Fisher information.
- Concentration does happen, but the way that it does does not map very cleanly onto the theory.
- The additional noise from generalization makes this much less clean for test than for training data.
Iterative approaches remain troublesome in terms of convergence.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
datageneration		datageneration
experiments		experiments
writeups		writeups
.gitignore		.gitignore
Results0.1-InitialResults.ipynb		Results0.1-InitialResults.ipynb
Results0.2-finetuning.ipynb		Results0.2-finetuning.ipynb
Results1.0-FisherInformation.ipynb		Results1.0-FisherInformation.ipynb
Results1.1-FisherInfo2.ipynb		Results1.1-FisherInfo2.ipynb
Results1.2-FisherInfor3.ipynb		Results1.2-FisherInfor3.ipynb
Results1.3-TrainingResults.ipynb		Results1.3-TrainingResults.ipynb
Results1.4-nextPdf.ipynb		Results1.4-nextPdf.ipynb
Results2.0-objective-effects.ipynb		Results2.0-objective-effects.ipynb
Results2.1-retraining.ipynb		Results2.1-retraining.ipynb
Results2.2-backprop_dynamics.ipynb		Results2.2-backprop_dynamics.ipynb
Results2.3-simple_tests_more_exploration.ipynb		Results2.3-simple_tests_more_exploration.ipynb
Results2.4-experiment_analysis.ipynb		Results2.4-experiment_analysis.ipynb
Results2.5-interation_approaches.ipynb		Results2.5-interation_approaches.ipynb
Results3.0-return.ipynb		Results3.0-return.ipynb
adapt_fit_loop.py		adapt_fit_loop.py
adaptableModel.py		adaptableModel.py
basicModel.py		basicModel.py
discriminationAnalysis.py		discriminationAnalysis.py
example_codes.py		example_codes.py
nonlinFisher.py		nonlinFisher.py
old_solutions_analytic.py		old_solutions_analytic.py
readme.md		readme.md
rotatedFaces.py		rotatedFaces.py
scratch2.2-FIspeedup.ipynb		scratch2.2-FIspeedup.ipynb
trainers.py		trainers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Studying fine tuning and multitask learning in an angle discrimination task

About

Uh oh!

Releases

Packages

Languages

lrast/angleFinetuning

Folders and files

Latest commit

History

Repository files navigation

Studying fine tuning and multitask learning in an angle discrimination task

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages