daily_log.md 2.7 KB

Daily Log

Friday, June 14, 2024

Relativly sedate day, mostly just rewriting the dataset system to ensure that loaded datasets use in evaluation are the same as the ones used in training. This ensures that there is no leakage of the train set into the validation and testing sets. Plan to meet with Ali and Brayden on Tuesday to go over results of the coverage analysis, since our results do not indicate the expected negative relationship between coverage and f1/accuracy/AUC.

Progress

  • Rewrote training to save datasets used - changed model directory structure to accomidate
  • Rewrote dataset loading in evaluation methods to ensure that the same datasets are used as in training
  • Trained new 10 epoch 30 model ensemble and evaluated
    • Found better results than in previous ensembles, mostly negative relationship established, but still seeing sharp drop at very low coverage

Future

  • Investigate reasons for low accuracy at high certainty
    • Do careful reading of papers on coverage analysis
    • Compare to other models maybe??
    • Meet with Ali and Brayden about results, brainstorm reasons
  • Continue reading more papers on uncertainty
  • Investigate BNNs further

Monday, June 17, 2024

First day of Week 3! Pretty slow today as well, mainly working on ECE and figuring out why the coverage curves are so weird. Reading more about ensemble methods, going to try some calibration techniques. Overall, though, pretty good! Hope to be able to do some more after meeting with Ali and maybe Brayden, potentially look at some training methods (bagging, boosting etc.) and continue to investigate the weird coverage metrics.

Progress

  • Implemented ECE metric
  • Continued reading about uncertainty quantification
  • Looked into bagging/boosting implementations

Future

  • Continue to investigate low accuracy at high certainty
  • Continue reading
  • Meet with Ali Wednesday

Tuesday, June 18, 2024

Slow day today, mostly continued with reading. Began training a new model with 100x30 runs/epochs, should be able to serve as base model for future work now that the data is deterministic

Progress

  • Continued to read about calibration and calibration errors
  • Investigated libraries for ensemble tranining

Future

  • Meet with Ali
  • Continue reading

Wednesday, June 19, 2024

Work from home day, mostly planning and reading. Met with Ali and Brayden, discussed progress and began initial thinking towards future paths for project. Main focus on improving our uncertainty z

Progress

  • Met with Ali and Brayden
  • Began thinking about future progress
  • Worked on ensemble a bit

Future

  • Fix uncertainty metric
  • Evaluate performance of different metrics
  • Write research proposal