adaptive_io.py (1 line): - line 111: proj_hid = F.linear(hidden, proj.t().contiguous()) # TODO: .contiguous() not necessary?