CU•2•CL

Automating CUDA-to-OpenCL Translation

  • If I have seen further it is by standing on the shoulders of giants — Issac Newton

Publications

Sort by: Publication type   Area   Sub Area   Date (papers only)   First author (papers only)  
Jump to: 2021   2020   2019   2018   2017   2016   2015   2014   2013   2012   2011   2010   2009   2008   2007   2006   2005   2004   2003   2002   2001   2000   1999   1998   1997  

  • 2020

  • Exploring FPGA Optimizations in OpenCL for Breadth-First Search on Sparse Graph Datasets.
    Atharva Gondhalekar, Wu-chun Feng.
    In Proceedings of the 30th International Conference on Field-Programmable Logic and Applications, Gothenburg, Sweden, September 2020.
      Paper               Citations:  [ BibTeX    XML    PlainText ]   
  • 2019

  • Adaptive Task Aggregation for High-Performance Sparse Solvers on GPUs.
    Ahmed E. Helal, Ashwin M. Aji, Michael L. Chu, Bradford M. Beckmann, Wu-chun Feng.
    In Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, Seattle, WA, September 2019.
      Paper               Citations:  [ BibTeX    XML    PlainText ]   
  • On the Portability of GPU-Accelerated Applications via Automated Source-to-Source Translation.
    Paul Sathre, Mark Gardner, Wu-chun Feng.
    In Proceedings of the HPC Asia: International Conference on High Performance Computing in Asia-Pacific Region, Guangzhou, China, January 2019.
      Paper               Citations:  [ BibTeX    XML    PlainText ]   
  • 2018

  • Exploring FPGA-specific Optimizations for Irregular OpenCL Applications.
    Mohamed W. Hassan, Ahmed E. Helal, Peter M. Athanas, Wu-chun Feng, Yasser Y. Hanafy.
    In Proceedings of the International Conference on Reconfigurable Computing and FPGAs (ReConFig), Cancun, Mexico, December 2018.
      Paper               Citations:  [ BibTeX    XML    PlainText ]   
  • A Composable Workflow for Productive Heterogeneous Computing on FPGAs via Whole-Program Analysis and Transformation.
    Paul Sathre, Ahmed E. Helal, Wu-chun Feng.
    In Proceedings of the International Conference on Reconfigurable Computing and FPGAs (ReConFig), Cancun, Mexico, December 2018.
      Paper               Citations:  [ BibTeX    XML    PlainText ]   
  • A Framework for Auto-Parallelization and Code Generation: An Integrative Case Study with Legacy FORTRAN Codes.
    Konstantinos Krommydas, Paul Sathre, Ruchira Sasanka, Wu-chun Feng.
    In Proceedings of the 47th International Conference on Parallel Processing (ICPP), Eugene, OR, August 2018.
      Paper               Citations:  [ BibTeX    XML    PlainText ]   
  • CommAnalyzer: Automated Estimation of Communication Cost and Scalability on HPC Clusters from Sequential Code.
    Ahmed E. Helal, Changhee Jung, Wu-chun Feng, Yasser Y. Hanafy.
    In Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing (ACM HPDC 2018), Tempe, Arizona, USA, June 2018.
      Paper               Citations:  [ BibTeX    XML    PlainText ]   
  • 2013

  • Characterizing the Challenges and Evaluating the Efficacy of a CUDA-to-OpenCL Translator.
    Mark Gardner, Paul Sathre, Wu-chun Feng, Gabriel Martinez.
    In Parallel Computing, 39 (12): 769-786, December 2013.
      Preprint               Citations:  [ BibTeX    XML    PlainText ]   
  • 2012

  • Lost in Translation: Challenges in Automating CUDA-to-OpenCL Translation.
    Paul Sathre, Mark Gardner, Wu-chun Feng.
    In Proceedings of the 5th International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), Pittsburgh, PA, September 2012.
      Paper               Citations:  [ BibTeX    XML    PlainText ]   
  • 2011

  • CU2CL: A CUDA-to-OpenCL Translator for Multi- and Many-Core Architectures.
    Gabriel Martinez, Mark Gardner, Wu-chun Feng.
    In Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, Tainan, Taiwan, December 2011.
      Paper               Citations:  [ BibTeX    XML    PlainText ]   

Sponsors

CHREC AMD DOD
Xilinx Harris

As a thanks for supporting our work, sponsors receive early access to all major releases. If you are interested in becoming a sponsor, please contact us.

Recent News

CU2CL 0.8.0b Released

03/21/17: We are pleased to announce the release of the 0.8.0b version of CU2CL. The whole program translation architecture debuted in CU2CL 0.7.0b has been expanded to include our first cross-AST translation and type propagations. A binary tarball is available with registration here and the full source is available on GitHub.

CU2CL at SC'16 Emerging Technologies Showcase

09/27/16: Sathre, Gardner and Feng have been selected to demonstrate CU2CL's effectiveness during the Emerging Technologies Showcase at Supercomputing'16 in Salt Lake City.

More publications ...

CU2CL Releases

Source Release 0.8.0b

Latest update (03/21/17)

Binary Release 0.8.0b

Latest update (03/21/17)

CU2CL License

Read | Download