Table of Contents
Advances in Software Engineering
Volume 2015, Article ID 940628, 14 pages
http://dx.doi.org/10.1155/2015/940628
Research Article

LTTng CLUST: A System-Wide Unified CPU and GPU Tracing Tool for OpenCL Applications

Department of Computer and Software Engineering, Polytechnique Montreal, P.O. Box 6079, Station Downtown, Montreal, QC, Canada H3C 3A7

Received 14 April 2015; Accepted 1 July 2015

Academic Editor: Moreno Marzolla

Copyright © 2015 David Couturier and Michel R. Dagenais. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. B. Gregg and J. Mauro, DTrace; Dynamic Tracing in Oracle Solaris, Mac OS X, and FreeBSD, vol. 26, Book News, 2011, http://search.proquest.com/docview/869984396.
  2. S. E. Fagan, “Tracing BSD system calls,” Dr. Dobb's Journal, vol. 23, no. 3, p. 38, 1998, http://search.proquest.com/docview/202719549. View at Google Scholar
  3. W. C. Don Domingo, SystemTap 2.7—System-Tap Beginners Guide: Introduction to SystemTap, 2013.
  4. M. Desnoyers, Low-impact operating system tracing [Ph.D. thesis], École Polytechnique de Montréal, 2009.
  5. OpenCL—the open standard for parallel programming of heterogeneous systems, 2015, https://www.khronos.org/opencl/.
  6. Strace project: strace(1) Linux Manual Pages, 2010, http://man7.org/linux/man-pages/man1/strace.1.html.
  7. M. Desnoyers and M. R. Dagenais, “Lockless multi-core high-throughput buffering scheme for kernel tracing,” ACM SIGOPS Operating Systems Review, vol. 46, no. 3, pp. 65–81, 2012. View at Publisher · View at Google Scholar
  8. B. Gregg, “Strace wow much syscall,” 2014, http://www.brendangregg.com/blog/2014-05-11/strace-wow-much-syscall.html.
  9. B. Gregg, Tracing Summit 2014: From DTrace To Linux, Brendan Gregg (Netix), 2014.
  10. D. Goulet, Unified kernel/user-space efficient Linux tracing architecture [M.S. thesis], École Polytechnique de Montréal, Montréal, Canada, 2012.
  11. H. Waly and B. Ktari, “A complete framework for kernel trace analysis,” in Proceedings of the 24th Canadian Conference on Electrical and Computer Engineering (CCECE '11), pp. 001426–001430, May 2011. View at Publisher · View at Google Scholar · View at Scopus
  12. N. Ezzati-Jivan and M. R. Dagenais, “A stateful approach to generate synthetic events from kernel traces,” Advances in Software Engineering, vol. 2012, Article ID 140368, 12 pages, 2012. View at Publisher · View at Google Scholar
  13. A. Palnitkar, P. Saggurti, and S.-H. Kuang, “Finite state machine trace analysis program,” in Proceedings of the International Verilog HDL Conference, pp. 52–57, IEEE, Santa Clara, Calif, USA, March 1994. View at Publisher · View at Google Scholar
  14. N. Ezzati-Jivan and M. R. Dagenais, “A framework to compute statistics of system parameters from very large trace files,” ACM SIGOPS Operating Systems Review, vol. 47, no. 1, pp. 43–54, 2013. View at Publisher · View at Google Scholar
  15. N. Ezzati-Jivan and M. R. Dagenais, “Cube data model for multilevel statistics computation of live execution traces,” Concurrency and Computation: Practice and Experience, vol. 27, no. 5, pp. 1069–1091, 2015. View at Publisher · View at Google Scholar · View at Scopus
  16. R. Dietrich, F. Schmitt, R. Widera, and M. Bussmann, “Phase-based profiling in GPGPU kernels,” in Proceedings of the 41st International Conference on Parallel Processing Workshops (ICPPW '12), pp. 414–423, September 2012. View at Publisher · View at Google Scholar · View at Scopus
  17. P. Mistry, C. Gregg, N. Rubin, D. Kaeli, and K. Hazel-Wood, “Analyzing program flow within a many-kernel OpenCL application,” in Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-4, pp. 10:1–10:8, ACM, New York, NY, USA, 2011. View at Publisher · View at Google Scholar
  18. J. D. Owens, M. Houston, D. Luebke, S. Green, J. E. Stone, and J. C. Phillips, “GPU computing,” Proceedings of the IEEE, vol. 96, no. 5, pp. 879–899, 2008. View at Publisher · View at Google Scholar · View at Scopus
  19. G. Juckeland, “Trace-based performance analysis for hardware accelerators,” in Tools for High Performance Computing 2011, H. Brunst, M. S. Mller, W. E. Nagel, and M. M. Resch, Eds., pp. 93–104, Springer, Berlin, Germany, 2012. View at Publisher · View at Google Scholar
  20. Khronos Group, OpenCL Reference Pages, Khronos Group, 2011, https://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/.
  21. B. Poirier, R. Roy, and M. Dagenais, “Accurate offline synchronization of distributed traces using kernel-level events,” ACM SIGOPS Operating Systems Review, vol. 44, no. 3, pp. 75–87, 2010. View at Publisher · View at Google Scholar
  22. M. Jabbarifar, “On line trace synchronization for large scale distributed systems,” ProQuest, UMI Dissertations Publishing, 2013, http://search.proquest.com/docview/1561560787.