This is a bonus report for me. In this homework, I work on accelerate matrix calculation with `collapse` command; however, naive approaches will incur race condition. Thus, guarantees of the independent calculation of each block is very crucial in this task.