In our recent work on elastic weight consolidation (EWC) (1) we show that forgetting in neural networks can be alleviated by using a quadratic penalty whose derivation was inspired by Bayesian evidence accumulation. In his letter (2), Dr. Huszár provides an alternative form for this penalty by following the standard...
http://ift.tt/2FzmVzV
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου