Page 286 - 2024-Vol20-Issue2
P. 286

282 |                                                            Alrudainy, Marzook, Hussein & Shafik

 [2] H. Esmaeilzadeh, E. Blem, R. S. Amant, K. Sankar-                 ment of concurrent workloads for energy-efficient multi-
      alingam, and D. Burger, “Dark silicon and the end of             core systems,” Journal of Low Power Electronics and
      multicore scaling,” IEEE micro, vol. 32, no. 3, pp. 122–         Applications, vol. 10, no. 3, p. 25, 2020.
      134, 2012.
                                                                 [12] S. Tzilis, P. Trancoso, and I. Sourdis, “Energy-efficient
 [3] A. Shafaei Bejestan, Y. Wang, S. Ramadurgam, Y. Xue,              runtime management of heterogeneous multicores using
      P. Bogdan, and M. Pedram, “Analyzing the dark silicon            online projection,” ACM Transactions on Architecture
      phenomenon in a many-core chip multi-processor under             and Code Optimization (TACO), vol. 15, no. 4, pp. 1–26,
      deeply-scaled process technologies,” in Proceedings of           2019.
      the 25th edition on Great Lakes Symposium on VLSI,
      pp. 127–132, 2015.                                         [13] A. K. Singh, A. Prakash, K. R. Basireddy, G. V. Mer-
                                                                       rett, and B. M. Al-Hashimi, “Energy-efficient run-time
 [4] J. Henkel, H. Bukhari, S. Garg, M. U. K. Khan, H. Khdr,           mapping and thread partitioning of concurrent opencl
      F. Kriebel, U. Ogras, S. Parameswaran, and M. Shafique,          applications on cpu-gpu mpsocs,” ACM Transactions on
      “Dark silicon: From computation to communication,”               Embedded Computing Systems (TECS), vol. 16, no. 5s,
      in Proceedings of the 9th International Symposium on             pp. 1–22, 2017.
      Networks-on-Chip, pp. 1–8, 2015.
                                                                 [14] C. Hankendi and A. K. Coskun, “Adaptive power and re-
 [5] X. Wang, A. K. Singh, B. Li, Y. Yang, H. Li, and T. Mak,          source management techniques for multi-threaded work-
      “Bubble budgeting: Throughput optimization for dy-               loads,” in 2013 IEEE International Symposium on Paral-
      namic workloads by exploiting dark cores in many core            lel & Distributed Processing, Workshops and Phd Forum,
      systems,” IEEE Transactions on Computers, vol. 67,               pp. 2302–2305, IEEE, 2013.
      no. 2, pp. 178–192, 2017.
                                                                 [15] R. A. Shafik, S. Yang, A. Das, L. A. Maeda-Nunez,
 [6] X. Wang, B. Zhao, L. Wang, T. Mak, M. Yang, Y. Jiang,             G. V. Merrett, and B. M. Al-Hashimi, “Learning transfer-
      and M. Daneshtalab, “A pareto-optimal runtime power              based adaptive energy minimization in embedded sys-
      budgeting scheme for many-core systems,” Micropro-               tems,” IEEE Transactions on Computer-Aided Design of
      cessors and Microsystems, vol. 46, pp. 136–148, 2016.            Integrated Circuits and Systems, vol. 35, no. 6, pp. 877–
                                                                       890, 2015.
 [7] E. Musoll, “Hardware-based load balancing for mas-
      sive multicore architectures implementing power gating,”   [16] A. Das, A. Kumar, B. Veeravalli, R. Shafik, G. Mer-
      IEEE Transactions on Computer-Aided Design of Inte-              rett, and B. Al-Hashimi, “Workload uncertainty char-
      grated Circuits and Systems, vol. 29, no. 3, pp. 493–497,        acterization and adaptive frequency scaling for energy
      2010.                                                            minimization of embedded systems,” in 2015 Design,
                                                                       Automation & Test in Europe Conference & Exhibition
 [8] J. N. Mistry, B. M. Al-Hashimi, D. Flynn, and S. Hill,            (DATE), pp. 43–48, IEEE, 2015.
      “Sub-clock power-gating technique for minimising leak-
      age power during active mode,” in 2011 Design, Automa-     [17] B. K. Reddy, M. J. Walker, D. Balsamo, S. Diestelhorst,
      tion & Test in Europe, pp. 1–6, IEEE, 2011.                      B. M. Al-Hashimi, and G. V. Merrett, “Empirical cpu
                                                                       power modelling and estimation in the gem5 simulator,”
 [9] J. Charles, P. Jassi, N. S. Ananth, A. Sadat, and A. Fe-          in 2017 27th International Symposium on Power and
      dorova, “Evaluation of the intel® core™ i7 turbo boost           Timing Modeling, Optimization and Simulation (PAT-
      feature,” in 2009 IEEE International Symposium on                MOS), pp. 1–8, IEEE, 2017.
      Workload Characterization (IISWC), pp. 188–197, IEEE,
      2009.                                                      [18] A. Aalsaud, A. Rafiev, F. Xia, R. Shafik, and
                                                                       A. Yakovlev, “Model-free runtime management of con-
[10] K. Ma and X. Wang, “Pgcapping: Exploiting power                   current workloads for energy-efficient many-core het-
      gating for power capping and core lifetime balancing             erogeneous systems,” in 2018 28th International Sympo-
      in cmps,” in Proceedings of the 21st international con-          sium on Power and Timing Modeling, Optimization and
      ference on Parallel architectures and compilation tech-          Simulation (PATMOS), pp. 206–213, IEEE, 2018.
      niques, pp. 13–22, 2012.

[11] A. Aalsaud, F. Xia, A. Rafiev, R. Shafik, A. Romanovsky, [19] A. Aalsaud, R. Shafik, A. Rafiev, F. Xia, S. Yang, and

and A. Yakovlev, “Low-complexity run-time manage-                A. Yakovlev, “Power–aware performance adaptation of
   281   282   283   284   285   286   287   288   289   290   291