Papers

Listed below are publications organized by year.


To Appear

  • Y. Chen, A. Khadem, X. He, N. Talati, T. Khan, and T. Mudge. PEDAL: A Power Efficient GCN Accelerator with Multiple DAtafLows. Design, Automation and Test in Europe (DATE), Anterwerp, Belgium, April 2023.

2023

  • K. Flautner, N. Kim, S. Martin, D. Blaauw, T. Mudge. RETROSPECTIVE: Drowsy Caches: Simple techniques for reducing leakage power. ISCA@50 25-Year Retrospective: 1996-2020. June 2023, ACM SIGARCH & IEEE TCCA, https://bit.ly/isca50_retrospective. 
  • V. Cuppu, B. Jacob, B. Davis, T. Mudge. RETROSPECTIVE: A performance comparison of contemporary DRAM architectures. ISCA@50 25-Year Retrospective: 1996-2020. June 2023, ACM SIGARCH & IEEE TCCA, https://bit.ly/isca50_retrospective. 
  • H-W. Kim, H. Ye, T. Mudge, R. Dreslinski, N. Talati. RecPIM: A PIM-enabled DRAM- RRAM hybrid memory system for recommendation models.  2023 Int. Symp. on Low Power Electronics Design (ISLPED), Vienna, Austria, August 2023, pp. 1-6, doi: 10.1109/ISLPED58423.2023.10244420.
  • Y. Chen, A. Khadem, X. He, N. Talati, T. Khan, and T. Mudge. PEDAL: A Power Efficient GCN Accelerator with Multiple DAtafLows. Design, Automation and Test in Europe (DATE), Anterwerp, Belgium, April 2023, pp. 1-6, doi: 10.23919/DATE56975.2023.10137240
  • H. Ye, S. Vedula, Y. Chen, Y. Yang, A. Bronstein, R. Dreslinski, T. Mudge, N. Talati. GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference. 23rd Int. Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Vancouver, Canada, March 2023, pp. 282-301. https://doi.org/10.1145/3582016.3582029 [pdf]
  • U. Ogras, R. Marculescu, T. Mudge, M. Kishinevsky. Introduction to the Special Issue on Domain-Specific System-on-Chip Architectures and Run-Time Management Techniques. ACM Trans. on Embedded Computing Systems, vol. 22, no. 2, 2023, pp. 1-3. doi.acm.org?doi=3567834 [pdf]
  • A. Krishnakumar, U. Ogras, R. Marculescu, M. Kishivensky, T. Mudge. Domain-Specific Architectures (DSAs): Research Problems and Promising Approaches. ACM Trans. on Embedded Computing Systems, vol. 22, no. 2, 2023, pp. 28:1-28:26 https://doi.org/10.1145/3563946 [pdf]

2022

  • N. Talati, H. Ye, S. Vedulah, K-Y. Chen, Y. Chen, D. Liu, D. Blaauw, A. Bronstein, T. Mudge, R. Dreslinski. Mint: An Accelerator For Mining Temporal Motifs. The 55th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), Chicago, IL, October 2022, pp. 1270-1287. [pdf]
  • X. He , K-Y. Chen,  S. Feng, H.-S. Kim, D. Blaauw, R. Dreslinski,  T. Mudge. Squaring the circle: Executing Sparse Matrix Computations  on FlexTPU—a TPU-like processor. 31st Int. Conf. on Parallel Architectures and Compilation Techniques (PACT), Chicago, IL, October 2022, pp.148-159. https://doi.org/10.1145/3559009.3569665 [pdf]
  •  L. Belayneh, H. Ye, K-Y. Chen,  D. Blaauw, T. Mudge, R. Dreslinski, N. Talati.  Locality-aware Optimizations for Improving Remote Memory Latency in Multi-GPU Systems. 31st Int. Conf. on Parallel Architectures and Compilation Techniques (PACT), Chicago, IL, October 2022, pp. 304-316. https://doi.org/10.1145/3559009.3569649 [pdf]
  • B. Oh, N. Abeyratne, N. S. Kim, J. Ahn, R. G. Dreslinski and T. Mudge. Rethinking DRAM’s page mode with STT-MRAM. IEEE Transactions on Computers, September 2022, pp. 1-14. https://doi.org/10.1109/TC.2022.3207131 [pdf]
  • Y. Xiong, J. Li, D. Blaauw, H.-S. Kim, T. Mudge, R. Dreslinski, and C. Chakrabarti. Improving Energy Efficiency of Convolutional Neural Networks on Multi-core Architectures through Run-time Reconfiguration. 2022 IEEE Int. Symp. on Circuits and Systems (ISCAS), Austin TX,  June 2022, pp. 375-379. [pdf]
  • S. Feng, X. He, K-Y. Chen, L. Ke, X. Zhang, D. Blaauw, T. Mudge, and R. Dreslinski. MeNDA: A Near-Memory Multi-way Merge Solution for Sparse Transposition and Dataflows. Proc. 49th Ann. Int. Symp. on Computer Architecture (ISCA), NY, June 2022, pp. 245-258. https://doi.org/10.1145/3470496.3527432 [pdf]
  • N. Talati, H. Ye, Y. Yang, L. Belayneh, K-Y. Chen, D. Blaauw, T. Mudge, R. Dreslinski. NDMiner: Accelerating Graph Pattern Mining Using Near Data Processing. Proc. 49th Ann. Int. Symp. on Computer Architecture (ISCA), NY, June 2022pp. 146-159. https://doi.org/10.1145/3470496.3527437 [pdf]
  • K.-Y. Chen, C.-S. Yang, Y.-H. Sun, C.-W. Tseng, M. Fayazi, X. He, S. Feng,Y. Yue, T. Mudge, R. Dreslinski, H.-S. Kim, D. Blaauw. A 507 GMACs/J 256-Core Domain Adaptive Systolic-Array-Processor for Wireless Communication and Linear-Algebra Kernels in 12nm FINFET2022 Symposia on VLSI Technology and Circuits (VLSI), Honolulu, HI, June 2022, pp. 202-203. DOI: 10.1109/VLSITechnologyandCir46769.2022.9830330 [pdf]
  • S. Kim, M. Fayazi, A. Daftardar, K-Y. Chen, J. Tan, S. Pal, T. Ajayi, Y. Xiong, T. Mudge, C. Chakrabarti, D. Blaauw, R. Dreslinski, H-S. Kim. Versa: A 36-Core Systolic Multiprocessor with Dynamically-Reconfigurable Interconnect and Memory. IEEE Jour. Solid-State Circuits, vol. 57, no. 4, April 2022, pp. 986-998. [pdf]

2021

  • N. Talati, D. Jin, H. Ye, A. Brahmakshatriya, S. Amarasinghe, T. Mudge, D. Koutra, R. Dreslinski. A Deep Dive Into Understanding The Random Walk-Based Temporal Graph Learning. 2021 IEEE International Symposium on Workload Characterization (IISWC), November 2021. [virtual][pdf]
  • A. Khadem, H. Ye, and T. Mudge. CoDR: Computation and Data Reuse Aware CNN Accelerator. Poster at the Design Automation Conference, San Francisco, July 2021. [poster][arXiv:2104.09798 pdf]
  • S. Feng, J.Suny, S. Pal, X. He, K. Kaszyk, D-H, Park, M. Mortony, T. Mudge, M. Coley, M. O’Boyle, C. Chakrabarti, and R. Dreslinski. CoSPARSE: A Software and Hardware Reconfigurable SpMVFramework for Graph Analytics. Design Automation Conference, San Francisco, July 2021. [pdf]
  • S. Kim, M. Fayazi, A. Daftardar, K-Y. Chen, J. Tan, S. Pal, T. Ajayi1, Y. Xiong, T. Mudge, C. Chakrabarti, D. Blaauw, R. Dreslinski, H-S. Kim. Versa: A Dataflow-Centric Multiprocessor with 36 Systolic ARM Cortex-M4F Cores and a Reconfigurable Crossbar-Memory Hierarchy in 28nm. 2021 Symposia on VLSI Technology and Circuits, Kyoto, June 2021. [virtual] [pdf] [talk]
  • N. Talati, K. May, A. Behroozi, Y. Yang, K. Kaszyk, C. Vasiliadiotis, T. Verma, L.Li, B. Nguyen, J. Sun, J. Morton, A. Ahmadi, T. Austin, M. O’Boyle, S. Mahlke, T. Mudge, and R. Dreslinski. Prodigy: Improving the Memory Latency of Data-Indirect Irregular Workloads Using Hardware-Software Co-Design. Symp. on High Performance Computer Architecture (HPCA), Seoul, S. Korea, February 2021. [Best Paper Award] [virtual][pdf]

2020

  • Y. Xiong, J. Zhou, S. Pal, D. Blaauw, H.-S. Kim, T. Mudge, R. Dreslinski, C. Chakrabarti. Accelerating Deep Neural Network Computation on a Low Power Reconfigurable Architecture.   IEEE Int. Symp. on Circuits and Systems (ISCAS), Seville, Spain, October 2020, pp. 1-5. [virtual] [pdf]
  • S. Pal, S. Feng, D.-H. Park, S. Kim, A. Amarnath, C.-S. Yang, X. He, J. Beaumont, K. May, Y. Xiong, Kuba Kaszyk, J. M. Morton, J. Sun, M. O’Boyle, M. Cole, C. Chakrabarti, D. Blaauw, H.-S. Kim, T. Mudge, R. DreslinskiTransmuter: Bridging the Efficiency Gap using Memory and Dataflow Reconfiguration.  9th Conf. on Parallel Architectures and Compilation Techniques (PACT), Atlanta, October 2020. [virtual] [pdf]
  • S. Pal, K. Kaszyk, S. Feng, B. Franke, M. Cole, M. O’Boyle, T. Mudge, R. G. Dreslinski. HetSim: Simulating Large-Scale Heterogeneous Systems using a Trace-driven, Synchronization and Dependency-Aware Framework.. IEEE Int. Symp. on Workload Characterization (IISWC-2020), Beijing, China, October 2020, pp. 13-24. [virtual] [pdf]
  • Yang, H. Ye, Y. Chen, X. Liu, N. Talati, X. He, T. Mudge, and R. Dreslinski. CoPTA: Contiguous Pattern Speculating TLB Architecture. Twentieth Int. Conf. on Embedded Computer Systems: Architecture, Modeling and Simulation (SAMOS XX), Samos, Greece, July 2020.[virtual] [pdf][talk]
  • X. He, S. Pal, A. Amarnath, S. Feng, D-.H. Park, A. Rovinski, H. Ye, Y. Chen, R. Dreslinski, T. Mudge. Sparse-TPU: Adapting Systolic Arrays for Sparse Matrices.  34th ACM Int. Conf. on Supercomputing (ICS), Barcelona, Spain, June 2020, pp. 1-12. [virtual] [pdf][talk]
  • A. Soorishetty, J. Zhou, S. Pal, D. Blaauw, H.-S. Kim, T. Mudge, R. Dreslinski, C. Chakrabarti. Accelerating Linear Algebra Kernels On A Massively Parallel Reconfigurable Architecture. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, May 2020, pp. 1558-1562. [virtual] [pdf][talk]
  • D.-H. Park, S. Pal, S. Feng, P. Gao, J. Tan, A. Rovinski, S. Xie, C. Zhao, A. Amarnath, T. Wesley, J. Beaumont, K.-Y. Chen, C. Chakrabarti, M. Taylor, T. Mudge, D. Blaauw, H.-S. Kim, R. Dreslinski. A 7.3 M Output Non-Zeros/J, 11.7 M Output Non-Zeros/GB Reconfigurable Sparse Matrix-Matrix Multiplication Accelerator. IEEE Jour. Solid-State Circuits, vol. 55, no. 4, April 2020, pp. 933-944. [pdf]

2019

  • J. Beaumont, T. Mudge. Fine-Grained Management of Thread Blocks for Iregular Applications. IEEE 37th Int. Conf. on Computer Design (ICCD). IEEE 37th Int. Conf. on Computer Design (ICCD), Abu Dhabi, November 2019, pp. 283-292. [pdf]
  • B. Oh, N. Abeyratne, N. S. Kim, R. Dreslinski and T. Mudge.  SMART: STT-MRAM Architecture for Smart Activation and Sensing. MEMSYS, Washington DC, October 2019, pp. 316-330. [pdf]
  • S. Pal, D.-H. Park, S. Feng, P. Gao, J. Tan, A. Rovinski, S. Xie, C. Zhao, A. Amarnath, T. Wesley, J. Beaumont, K.-Y. Chen, C. Chakrabarti, M. Taylor, T. Mudge, D. Blaauw, H.-S. Kim, R. Dreslinski. A 7.3 M Output Non-Zeros/J Sparse Matrix-Matrix Multiplication Accelerator using Memory Reconfiguration in 40 nm. 2019 Symposia on VLSI Technology and Circuits, Kyoto Japan, June 2019. [pdf]
  • H-M. Chen, S-Y. Lee, C-J. Wu, T. Mudge, C. Chakrabarti. Configurable-ECC: Architecting a Flexible ECC Scheme to Support Different Sized Accesses in High Bandwidth Memory Systems. IEEE Transactions on Computers,vol. 86, no. 9, May 2019,  pp. 646-658. [pdf]

2018

  • S. Pal, J. Beaumont, D-H. Park, A. Amarnath, S. Feng, C. Chakrabarti, H-S. Kim, D. Blaauw, T. Mudge, R. Dreslinski. OuterSPACE: An Outer Product based Sparse Matrix Multiplication Accelerator. Int. Symp. on High Performance Computer Architecture (HPCA),Vienna, austria, February 2018, pp. 724-736. [pdf]
  • B. Oh, N. S. Kim, J. Ahn, B. Li, R. Dreslinski and T. Mudge.  A Load Balancing for Memory Channels. MEMSYS, Washington DC, October 2018, pp. 53-66. [pdf]

2017

  • D-H. Park, J Beaumont, T. Mudge. Accelerating Smith-Waterman Alignment workload with Scalable Vector Computing. REV-A Workshop at IEEE Cluster, Hawaii, September 2017, pp. 8. [pdf]
  • Y. Chen, S. Lu, C. Fu, D. Blaauw, R. Dreslinski, H.-S. Kim, T. Mudge. A Programmable Galois Field Processor for the Internet of Things. Proc. 44th Int. Symp. on Computer Architecture (ISCA), Toronto, Canada, June 2017, pp. 55-68. [pdf]
  • Y. Kang, J. Hauswald, C. Gao, A. Rovinski, T. Mudge, J. Mars, L. Tang. Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge. Proc. 22nd Int. Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS). Xi’an, China, April 2017, pp. 615-629. [pdf]
  • N. Pinckney, L. Shifren, B. Cline, S. Sinha, S. Jeloka, R. Dreslinski, T. Mudge, D. Sylvester, D. Blaauw. Impact of FinFET on Near-Threshold Voltage Scalability. IEEE Design & Test, March/April 2017, pp. 31-38. [pdf]
  • S. Bang, J. Wang, Z. Li, C. Gao, Y. Kim, Q. Dong, Y.P. Chen, L. Fick, X. Sun, R.G. Dreslinski, T. Mudge, H-S. Kim, D. Blaauw, D. Sylvester. A 288uW Programmable Deep Learning Processor with 270kB On-chip Weight Storage Using Non-uniform Memory Hierarchy for Mobile Intelligence. 2017 Int. Solid-State Circuits Conference (ISSCC), San Francisco, CA, February 2017, pp 250-251.[pdf]

2016

  • H-M. Chen, S. Jeloka, A. Arunkumar, D. Blaauw, C-J. Wu, T. Mudge, C. Chakrabarti. Using Low Cost Erasure and Error Correction Schemes to Improve Reliability of Commodity DRAM Systems. IEEE Transactions on Computers, December 2016, vol. 65, no. 12, pp. 3766-3779. [pdf]
  • Y. Chen, N. Chiotellis, Li.-X. Chuo, C. Pfeiffer, Y. Shi, R. Dreslinski, A. Grbic, T. Mudge, David D. Wentzloff, D. Blaauw, and H.-S. Kim. Energy-Autonomous Wireless Communication for Millimeter-Scale Internet-of-Things Sensor Nodes. IEEE Journal on Selected Areas in Communications, December 2016, vol. 34, no. 12, pp. 3962-3977. [pdf]
  • N. Abeyratne, H.-M. Chen, B. Oh, R. Dreslinski, C. Chakrabarti, T. Mudge. Checkpointing Exascale Memory Systems with Existing Memory Technologies. MEMSYS, Washington DC, October 2016, pp. 18-28. [pdf]
  • H-M. Chen, C-J. Wu, T. Mudge, C. Chakrabarti. RATT-ECC: Rate Adaptive Two-Tiered Error Correction Codes for Reliable 3D Die-Stacked Memory. ACM Transactions on Architecture and Code Optimization (TACO), 16, issue 3, September 2016, article 24, 24 pp. [pdf]
  • T. Mudge, R. Sendag, F. Chong, J. Yi, I. Markov, D. Chiou. Impact of Future Technologies on Architecture. IEEE MICRO, July/August 2016, pp. 48-56. [pdf]
  • Q. Zheng, Y. Chen, S. Abeyratne, R. Dreslinski, and T. Mudge. Designing General Purpose Cloud Platforms for Future Radio Access Networks. CTN: IEEE CompSoc Technology News http://www.comsoc.org/ctn. July 2016. [pdf]
  • N. Pinckney, B. Cline, R.G. Dreslinski, S. Jeloka, L. Shifrin, S. Sinha, T. Mudge, D. Sylvester, D. Blaauw. Near-Threshold Computing in FinFET Technologies: Opportunities for Improved Voltage Scalability. 53rd Design Automation Conference (DAC), Austin, TX, June 2016, pp. 1-6. [pdf]
  • J. Hauswald, M. A. Laurenzano, Y. Zhang, C. Li, A. Rovinski, A. Khurana, R.G. Dreslinski, T. Mudge, V. Petrucci, L. Tang, J. Mars. Sirius Implications For Future Warehouse-scale Computers. IEEE MICRO, May/June 2016, vol. 36, no. 3, pp. 42-53. [Top Pick: selected as one of the 12 best papers in computer architecture for 2015] [pdf]
  • J. Hauswald, M. A. Laurenzano, Y. Zhang, C. Li, A. Rovinski, A. Khurana, R.G. Dreslinski, T. Mudge, V. Petrucci, L. Tang, J. Mars. Designing Future Warehouse Scale Computers for Sirius, an Open End-to-End Voice and Vision Personal Assistant. ACM Trans. On Computer Systems, 34, 1, Article 2, April 2016, 32 pp. [pdf]
  • Y. Chen, S. Lu, H-S. Kim, D. Blaauw, R.G. Dreslinski, T. Mudge. A Low Power Software-Defined-Radio Baseband Processor for the Internet of Things. Proc. 22nd IEEE Int. Symp. on High Performance Computer Architecture (HPCA-22), March 2016, Barcelona, Spain, pp. 40-51. [pdf]

2015

  • Kloosterman, J. Beaumont, M. Wollman, A. Sethia, R.G. Dreslinski, T. Mudge, S. Mahlke. WarpPool: Sharing Requests with Inter-Warp Coalescing for Throughput Processors. The 48th Annual IEEE/ACM Int. Symp. on Microarchitecture (MICRO), Hawaii, December 2015, pp. 433-444.
  • H-M. Chen, A. Arunkumar, C-J. Wu, T. Mudge, C. Chakrabarti. E-ECC: Low Power Erasure and Error Correction Schemes for Increasing Reliability of Commodity DRAM Systems. MEMSYS 2015: International Symposium on Memory Systems, Washington DC, October 2015, pp. 60-70.
  • J. Hauswald, Y. Kang, M. Laurenzano, Q. Chen, C. Li, T. Mudge, R. G. Dreslinski, J. Mars, L. Tang. DjiNN and Tonic: DNN as a Service and Its Implications for Future Warehouse Scale Computers. Proc. 42nd Int. Symp. on Computer Architecture (ISCA), Portland, OR, June 2015, pp. 27-40.  [pdf]
  • T. Mudge. Thoughts on Winning the 2014 Eckert-Mauchly Award. IEEE MICRO. May 2015, pp. 144-146. [pdf]
  • T. Mudge. The Specialization Trend in Computer Hardware—A Perspective. Comm. ACM. Vol. 58, no. 4, April 2015, p. 84. [pdf]
  •  C. Gao, A. Gutierrez, M. Ranaj, R.G. Dreslinski, T. Mudge, C.-J. Wu. A Study of Mobile Device Utilization.  2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Philadelphia, March 2015, pp. 225-234. [pdf]
  • J. Ballast, M. Ahmed, R.G. Dreslinski, R. Higgins, D-I. Kang, T. Mudge, W. Snapp, E. Van Hensbergen, J. Walters, M. Wollman. The Next-Generation Space Processor (NGSP) Analysis Program: Benchmarking and ARM-Based Heterogeneous Multicore Processor Design for Space Applications. 40th Annual GOMACTech Conference, St. Louis March 2015.
  • J. Hauswald, M. A. Laurenzano, Y. Zhang, C. Li, A. Rovinski, A. Khurana, R.G. Dreslinski, T. Mudge, V. Petrucci, L. Tang, J. Mars.  Sirius: An Open End-to-End Voice and Vision Personal Assistant and Its Implications for Future Warehouse Scale Computers.  20th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Istanbul, March 2015, pp. 223-238. [pdf]
  • Q. Zheng, Y. Chen, H. Lee, R. Dreslinski, C. Chakrabarti, A. Anastasopoulos, S. Mahlke, and T. Mudge. Using Graphics Processing Units in an LTE Base Station. Journal of Signal Processing Systems, vol. 78, no. 1, pp. 35–47, January 2015. [pdf]

2014

  • T. Mudge (Author Retrospective). Improving data cache performance by pre-executing instructions under a cache miss. (James Dundas and Trevor Mudge) Proc. 1997 ACM Int. Conf. on Supercomputing, July 1997. Reprinted in the 25th Anniversary Issue of the International Conference on Supercomputing 1987-2011 (35 most influential papers), U. Banerjee Editor, 2014. pp. 40-41. [pdf]
  • S. Jeloka, R. Das, R.G. Dreslinski, T. Mudge, D. Blaauw. “Hi-Rise: A High-Radix Switch for 3D Integration with Single-Cycle Arbitration”. The 47th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). [pdf]
  • J. Pusdesris, B. VanderSloot, T. Mudge. “A Memory Rename Table to Reduce Energy and Improve Performance”, 2014 International Symposium on Low Power Electronics Design (ISLPED), August 2014, pp. 279-282. [Poster and text] [pdf]
  • A. Gutierrez, R.G. Dreslinski, T. Mudge. “Evaluating Private vs. Shared Last-Level Caches for Energy Efficiency in Asymmetric Multi-Cores”, Fourteenth International Conference on Embedded Computer Systems: Architecture, Modeling and Simulation (SAMOS), July 2014, pp. 191-198. [pdf]
  • S. Rao, S. Jeloka, R. Das, D. Blaauw, R. Dreslinski, T. Mudge. “VIX: Virtual Input Crossbar for Efficient Switch Allocation”, Fifty first Design Automation Conference (DAC), June 2014, pp. 1-6. [pdf]
  • N. Abeyratne, S. Jeloka, Y. Kang, D. Blaauw, R.G. Dreslinski, R. Das, T. Mudge. “Quality-of-Service for a High-Radix Switch”, Fifty first Design Automation Conference (DAC), June 2014, pp. 1-6. [pdf]
  • J. Hauswald, T. Manville, Q. Zheng, R.G. Dreslinski, C. Chakrabarti, T. Mudge. “A Hybrid Approach to Offloading Mobile Image Classification”, 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2014, pp. 8375-8379. [pdf]
  • R.G. Dreslinski, Q. Zheng, R.P. Higgins, J. Hauswald, D. Blaauw, T. Mudge, C. Chakrabarti, J. Ballast, W. Snapp. “An Architecture for Low-Power High-Performance Embedded Computing”, Thirty ninth Annual GOMACTech Conference (GOMAC), April 2014, pp. 423-426. [pdf]
  • A. Gutierrez, J. Pusdesris, R. Dreslinski, T. Mudge, C. Sudanthi, C.D. Emmons, M. Hayenga, N. Paver. “Sources of Error in Full System Simulation”, 2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), March 2014, pp. 13-22. [Best Paper Award] [pdf]
  • A. Gutierrez, M. Cieslak, B. Giridhar, R. Dreslinski, L. Ceze, T. Mudge. “Integrated 3D-Stacked Server Designs for Increasing Physical Density of Key-Value Stores”, Nineteenth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), March 2014, pp. 485-498. [pdf]
  • C. Gao, A. Gutierrez, R. G. Dreslinski, T. Mudge, K. Flautner, G. Blake. “A Study of Thread Level Parallelism on Mobile Devices”, 2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), March 2014, pp. 126-127. [Poster and text] [pdf]

2013

  • B. Giridhar, M. Cieslak, D. Duggal, R. Dreslinski, H. Chen, R. Patti, B. Hold, C. Chakrabarti T. Mudge, D. Blaauw. “Exploring DRAM Organizations for Energy-Efficient and Resilient Exascale Memories”, Proc. of SC 13, Denver, Co, November 2013, 12pp. [pdf]
  • R. G. Dreslinski, D. Fick, B. Giridhar, G. Kim, S. Seo, M. Fojtik, S. Satpathy, Y. Lee, D. Kim, N. Liu, M. Wieckowski, G. Chen, D. Sylvester, D. Blaauw, T. Mudge. “Centip3De: A Many-Core Prototype Exploring 3D Integration and Near-Threshold Computing”, Communications of the ACM (CACM), vol 56, no. 11, November 2013, pp.97-104. [pdf]
  • Q. Zheng, Y. Chen, R. Dreslinski, C. Chakrabarti, A. Anastasopoulos, S. Mahlke, T. Mudge. “Architecting an LTE Base Station with Graphics Processing Units“, 2013 IEEE Workshop on Signal Processing Systems (SiPS 2013). October 2013, pp .219-224. [pdf]
  • N. Pinckney, R.G. Dreslinski, K. Sewell, D. Fick, T. Mudge, D. Sylvester, D. Blaauw. “Limits of Parallelism and Boosting in Dim Silicon“, IEEE Micro. Sept.Oct 2013, pp. 30-37. [pdf]
  • Q. Zheng, Y. Chen, R. Dreslinski, C. Chakrabarti, A. Anastasopoulos, S. Mahlke, T. Mudge. “WiBench: An Open Source Kernal Suite for Benchmarking Wireless Systems”, Proc. of the IEEE Int. Symposium on Workload Characterization (IISWC), September 2013, pp.123-132. [pdf]
  • R. Dreslinski, B. Girdihar, M. Cieslak, Y. Kang, D. Blaauw, T. Mudge, J. Vetter, R. Schreiber. “An Evaluation Methodology for Exascale Memory Systems”, Workshop on Modeling & Simulation of Exascale Systems & Applications (MODSIM), Seattle, Wa., September 2013, [pdf]
  • Q. Zheng, Y. Chen, R. Dreslinski, C. Chakrabarti, A. Anastasopoulos, S. Mahlke, T. Mudge. “Parallelization Techniques for Implementing Trellis Algorithms on Graphics Processors”, IEEE International Symposium on Circuits and Systems (ISCAS 2013), May 2013, pp. 1220-1223. [pdf]
  • R. Dreslinski, D. Fick, B. Giridhar, G. Kim, S. SEo, M. Fojtik, S. Satpathy, Y. Lee, D. Kim, N. Liu, M. Wieckowski, G. Chen, D. Sylvester, D. Blaauw, T. Mudge. “Centip3De: A 64-Core, 3D Stacked Near-Threshold System”, Micro, IEEE, vol. 33, no.2, pp.8,16, March-April 2013. [pdf]
  • N. Abeyratne, R. Das, Q. Li, K. Sewell, B. Giridhar, R. Dreslinski, D. Blaauw, T. Mudge. “Scaling Toward Kilo-Core Processors with Asymmetric High-Radix Topologies”, 19th IEEE International Symposium on High Performance Computer Architecture (HPCA 2013), February 2013, pp. 496-507. [pdf]
  • D. Fick, R. Dreslinski, B. Giridhar, G. Kim, S. Seo, M. Fojtik, S. Satpathy, Y. Lee, D. Kim, N. Liu, M. Wieckowski, G. Chen, T. Mudge, D. Blaauw, D. Sylvester. “Centip3De: A Cluster-Based NTC Architecture With 64 ARM Cortex-M3 Cores in 3D Stacked 130 nm CMOS”, IEEE Journal of Solid-State Circuits, Vol. 48, No. 1, January 2013, pp. 104-117. [pdf]

2012

  • A, Sethia, S. Mahlke, G. Dasika, T. Mudge. “A Customized Processor for Energy Efficient Scientific Computing”, IEEE Transactions on Computers, Vol. 61, No.12, December 2012, pp. 1711-1723. [pdf]
  • A. Gutierrez, J. Pusdesris, R. Dreslinski, T. Mudge. “Lazy Cache Invalidation for Self-Modifying Codes”, The International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES 2012), October 2012, pp. 151-160. [pdf]
  • R. Dreslinski, T. Manville, K. Sewell, R. Das, N. Pinckney, S. Satpathy, D. Blaauw, D. Sylvester, T. Mudge. “XPoint Cache: Scaling Existing Bus Based Coherence Protocols for 2D and 3D Many-Core Systems”, The International Conference on Parallel Architectures and Compilation Techniques (PACT 2012), September 2012, pp. 75-85. [pdf]
  • G. Chen, T. Mudge, D. Sylvester, D. Blaauw. “Centip3De: A 64-Core, 3D Stacked, Near-Threshold System”, Hotchips, August 2012, pp.380-403.[pdf]
  • R. Dreslinski, K. Sewell, S. Satpathy, T. Manville, N. Pinckney, G. Blake, M. Cieslak, R. Das, T. Wenisch, D. Sylvester, D. Blaauw, T. Mudge. “Swizzle Switch: A Self-Arbitrating High-Radix Crossbar for NoC Systems”, Hotchips, August 2012, pp.380 – 403.[pdf]
  • R. Dreslinski, B. Giridhar, N. Pinckney, D. Blaauw, D. Sylvester, T. Mudge. “Reevaluating Fast Dual-Voltage Power Rail Switching Circuitry”, 2012 Workshop on Duplicating, Decontructing and Debunking (WDDD), June 2012, pp. 1-7.[pdf]
  • P. Tandon, J. Chang, R. Dreslinski, P. Ranganathan, T. Mudge, T.F. Wenisch. “PicoServer Revisited: On the Profitability of Eliminating Intermediate Cache Levels”, 2012 Workshop Duplicating, Decontructing and Debunking (WDDD), June 2012, pp. 1-10.[pdf]
  • N. Pinckney, R. Dreslinski, K. Sewell, D. Fick, D. Blaauw, D. Sylvester, T. Mudge. “Limits of Voltage-Scaled Parallel Architectures to Combat Dark Silicon”, 2012 Workshop on Dark Silicon (DaSi), June 2012. [pdf]
  • K. Sewell, R. Dreslinski, T. Manville, S. Satpathy, N. Pinckney, G. Blake, M. Cieslak, R. Das, T. Wenisch, D. Sylvester, D. Blaauw, T. Mudge. “Swizzle-Switch Networks for Many-Core Systems”. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, Vol. 2, No. 2, June 2012, pp. 278-294. [pdf]
  • S. Satpathy, R. Das, R. Dreslinski, T. Mudge, D. Sylvester, D. Blaauw. “High Radix Self-Arbitrating Switch Fabric with Multiple Arbitration Schemes and Quality of Service”. Proc. of the ACM/IEEE Design Automation Conference (DAC), San Francisco, CA, June 2012, pp. 406-411. [pdf]
  • S. Seo, R. Dreslinski, M. Woh, Y. Park, C. Charkrabari, S. Mahlke, D. Blaauw, T. Mudge. “Process Variation in Near-Threshold Wide SIMD Architectures”. Proc. of the ACM/IEEE Design Automation Conference (DAC), San Francisco, CA, June 2012, pp. 980-987. [pdf]
  • N. Pinckney, K. Sewell, R. Dreslinski, D. Fick, T. Mudge, D. Sylvester, and D. Blaauw. “Assessing the Performance Limits of Parallelized Near-Threshold Computing”. Proc. of the ACM/IEEE Design Automation Conference (DAC), San Francisco, CA, June 2012, pp. 1143-1148. [pdf]
  • J. Chang, K. Lim, T. Mudge, P. Ranganathan, D. Roberts, M. Shah. “A Limits Study of Benefits from Nanostore-Based Future Data-Centric System Architectures”. ACM Internation Conference on Computing Frontiers (CF’ 12), May 2012, pp. 33-42. [pdf]
  • D. Fick, R. Dreslinski, B. Giridhar, G. Kim, S. Seo, M. Fojtik, S. Satpathy, Y. Lee, D. Kim, N. Liu, M. Wieckowski, G. Chen, T. Mudge, D. Sylvester, D. Blaauw. “Centip3De: A 3930 DMIPS/W Configurable Near‐Threshold 3D Stacked System With 64 ARM Cortex‐M3 Cores”, IEEE International Solid-State Circuits Conference (ISSCC). San Francisco, CA, February 2012, pp. 190-191. [pdf]
  • S. Satpathy, K. Sewell, T. Manville, Yen-Po Chen, R. Dreslinski, D. Sylvester, T. Mudge, D. Blaauw. “A 4.5Tb/s 3.4Tb/s/W 64×64 switch fabric with self-updating least recently granted priority and quality of service arbitration in 45nm CMOS”, IEEE International Solid-State Circuits Conference (ISSCC). San Francisco, CA, February 2012, pp. 478-479. [pdf]

2011

  • A. Gutierrez, R. Dreslinski, A. Saidi, C. Emmons, N. Paver. T. Wenisch, T. Mudge. “Full-System Analysis and Characterization of Interactive Smartphone Applications”, IEEE International Symposium on Workload Characterization (IISWC-2011), Austin, TX, USA, November 6-8, 2011. pp. 81-90. [pdf]
  • G. Dasika, A. Sethia, T. Mudge, S. Mahlke. “PEPSC: A Power-Efficient Processor for Scientific Computing”, 20th Int. Conf. on Parallel Architectures and Compilation Techniques (PACT) Galveston Island, Texas, October 2011, pp. 101-110. [pdf]
  • C. Yang, Y. Emre, C. Chakrabarti, and T.Mudge, “Flexible product code-based ECC schemes for MLC Nand FLASH memories”, IEEE Workshop on Signal Processing Systems, (SiPS 2011), October 4-7 2011, Beirut, Lebanon. [pdf]
  • S. Satpathy, R. Dreslinski, T. Ou, D. Sylvester, T. Mudge, D. Blaauw. “SWIFT: A 2.1Tb/s 32×32 Self-Arbitrating Manycore Interconnect Fabric”, Symposia on VLSI Technology and Circuits. Kyoto, Japan, June 2011, pp. 138-139. [Winner in the 11th Annual International VLSI Symposium Low Power Contest] [pdf]
  • D. Fick, R. Dreslinski, B. Giridhar, G. Kim, S. Seo, M. Fojtik, S. Satpathy, Y. Lee, D. Kim, N. Liu, M. Wiekowski, G. Chen, T. Mudge, D. Sylvester, and D. Blaauw. “Design and Implementation of Centip3De, a 7-layer Many-Core System”, Proc. of the ACM/IEEE Design Automation Conference (DAC), San Francisco, CA, June 2011. [Winner DAC/ISSCC Student Design Contest]
  • M. Woh, S. Satpathy, R. Dreslinski, D. Kershaw, D. Sylvester, D. Blaauw, T. Mudge. “Low Power Interconnects for SIMD Computers”, Proceedings Design, Automation and Test in Europe DATE 11. March 2011. Grenoble, France. pp. 600-605. [pdf]
  • A. Hormati, M. Samadi, M. Woh, T. Mudge, S. Mahlke. “Sponge: Portable Stream Programming on Graphics Engines”, 16th International Conference on Architectural Support for Programming Languages and Operating Systems ASPLOS XVI. March 2011, Newport Beach, CA. pp. 381-392. [pdf]
  • G. Blake, R. Dreslinski, T. Mudge. “Bloom Filter Guided Transaction Scheduling”, Proceedings 37th International Symposium on High Performance Computer Architecture HPCA 17. February 2011. San Antonio, TX. pp. 75-86. [pdf]

2010

  • M. Woh, S. Seo, C. Chakrabarti, S. Mahlke, T. Mudge. “An Ultra Low Power SIMD Processor for Wireless Communications”, Proceedings of the Asilomar Conference on Signals, Systems and Computers. November, 2010. Asilomar, CA. [pdf]
  • G. Chen, D. Sylvester, D. Blaauw, T. Mudge. “Yield-driven Near-threshold SRAM Design”, IEEE Transactions on VLSI Systems. vol 18, no 11. November, 2010. pp. 1590-1598. [pdf]
  • G. Dasika, M. Woh, S. Seo, N. Clark, T. Mudge, S. Mahlke. “Mighty-Morphing Power-SIMD”, Proceedings 2010 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems CASES 10. October, 2010. Scottsdale, AZ. pp. 67-75. [pdf]
  • G. Dasika, A. Sethia, V. Robby, T. Mudge, S. Mahlke. “MEDICS: Ultra-Portable Processing for Medical Image Reconstruction”, 19th International Conference on Parallel Architectures and Compilation Techniques PACT 10. September, 2010. Vienna, Austria. pp. 181-192. [pdf]
  • H. Lee, C. Chakrabarti, T. Mudge. “A Low Power DSP for Wireless Communications”, IEEE Transactions on VLSI Systems. vol 18, no 9. September, 2010. pp. 1310-1322. [pdf]
  • S. Seo, M. Woh, R. Dreslinski, C. Chakrabarti, S. Mahlke, T. Mudge. “Diet SODA: A Power-Efficient Processor for Digital Cameras”,International Symposium on Low Power Electronics and Design ISLPED. August, 2010. pp. 79-84. [pdf]
  • T. Mudge. “Challenges And Opportunities For Extremely Energy-efficient Processors”, IEEE MICRO. vol 30, no 4. July, 2010. pp. 20-22.[pdf]
  • G. Blake, R. Dreslinski, T. Mudge, K. Flautner. “Evolution of Thread-Level Parallelism in Desktop Applications”, The 37th International Symposium on Computer Architecture. June, 2010. St. Malo, France. pp. 302-313. [pdf]
  • M. Shah, P. Ranganathan, J. Chang, N. Tolia, D. Roberts, T. Mudge. “Data Dwarfs: Motivating a Coverage Set for Future Large Data Center Workloads”, Architectural Concerns in Large Datacenters Workshop held in conjunction with The 37th International Symposium on Computer Architecture. June, 2010. St. Malo, France. [Poster and text] [pdf]
  • S. Satpathy, Z. Foo, B. Giridhar, R. Dreslinski, D. Sylvester, T. Mudge, D. Blaauw. “A 1.07 Tbit/s 128×128 Swizzle Network for SIMD Processors”, IEEE Symposium on VLSI Circuits. June, 2010. Honolulu, Hawaii. pp. 81-82. [pdf]
  • M. Wieckowski, R. Dreslinski, T. Mudge, D. Blaauw, D. Sylvester. “Circuit design advances for ultra-low power sensing platforms”, Proceedings of SPIE. vol 7679, 76790W. April, 2010. pp. 1-7. DOI: 10.1117/12.850720. [pdf]
  • K. Lim, J. Chang, J. Santos, Y. Turner, T. Mudge, P. Ranganathan, S. Reinhardt, T. Wenisch. “Hypervisor-based Prototyping of Disaggregated Memory and Benefits of VM Consolidation”, 15th International Conference Architectural Support for Programming Languages and Operating Systems. March, 2010. Pittsburgh, Pennsylvania. [Poster] [pdf]
  • A. Hormati, Y. Choi, M. Woh, M. Kudlur, R. Rabbah, T. Mudge, S. Mahlke. “MacroSS: Macro-SIMDization of Streaming Applications”, 15th International Conference on Architectural Support for Programming Languages and Operating Systems. March, 2010. Pittsburgh, Pennsylvania. pp. 285-296. [pdf]
  • R. Dreslinski, M. Wieckowski, D. Blaauw, D. Sylvester, T. Mudge. “Near-threshold Computing: Reclaiming Moore’s Law Through Energy Efficient Integrated Circuits”, Proceedings of the IEEE. vol 98, no 2. February, 2010. pp. 253-256. [pdf]
  • M. Woh, S. Seo, S. Mahlke, T. Mudge, C. Chakrabarti, K. Flautner. “AnySP: Anytime Anywhere Anyway Signal Processing”, IEEE MICRO Top Picks Issue. vol 30, no 1. January, 2010. pp. 81-91. [Top Pick: selected as one of the 12 best papers in computer architecture for 2009] [pdf]
  • T. Mudge. “Guest Editor’s Introduction: Top Picks From The Computer Architecture Conferences Of 2009”, IEEE MICRO. vol 30, no 1. January, 2010. pp. 8-11. [pdf]
  • M. Woh, S. Mahlke, T. Mudge, C. Chakrabarti. “Mobile Supercomputers for the Next-Generation Cell Phone”, Computer. vol 43, no 1. January, 2010. pp. 93-97. [pdf]
  • T. Kgil, D. Roberts, T. Mudge. (Eds.) Y. Xie, J. Cong, S. Sapatnekar. “PicoServer: Using 3D Stacking Technology To Build Energy Efficient Servers”, Kluwer Hardcover. 2010. pp. 219-260. ISBN: 978-1-4419-0783-7. [pdf]

2009

  • G. Dasika, A. Sethia, T. Mudge, S. Mahlke. “Low Power Scientific Computing”, 2009 Workshop on New Directions in Computer Architecture held in conjunction with The 42nd Annual IEEE/ACM International Symposium on Microarchitecture. December, 2009. New York, New York. pp. 7-8. [pdf]
  • G. Blake, R. Dreslinski, T. Mudge. “Proactive Transaction Scheduling for Contention Management”. The 42nd Annual IEEE/ACM International Symposium on Microarchitecture. December, 2009. New York, New York. pp. 156-167. [pdf]
  • R. Dreslinski, M. Wieckowski, D. Blaauw, D. Sylvester, T. Mudge. “Overcoming Moore’s Curse: Techniques for Powering Large Transistor Counts in Sub-Micron Technologies”, 2009 Workshop on New Directions in Computer Architecture held in conjunction with The 42nd Annual IEEE/ACM International Symposium on Microarchitecture. December, 2009. New York, New York. pp. 20-21. [pdf]
  • Y. Lin, M. Woh, S. Seo, C. Chakrabarti, S. Mahlke, T. Mudge. “Baseband Processing Architectures for SDR”, CRC Press. Chapter 21 in Wireless, Networking, Radar, Sensor Array Processing and Nonlinear Signal Processing, The Digital Signal Processing Handbook. November, 2009. pp. 21-1 — 21-18. [pdf]
  • G. Blake, R. Dreslinski, T. Mudge. “A Survey of Multicore Processors”, IEEE Signal Processing Magazine. vol 26, no 6. November, 2009. pp. 26-37. [pdf]
  • A. Hormati, Y. Choi, M. Kudlur, R. Rabbah, T. Mudge, S. Mahlke. “Flextream: Adaptive Compilation of Streaming Applications for Heterogeneous Architectures”, Parallel Architectures and Compilation Techniques. September, 2009. pp. 214-223. [pdf]
  • M. Woh, Y. Lin, S. Seo, S. Mahlke, T. Mudge. “Analyzing the Next Generation Software Defined Radio for Architectures”, Springer Journal of Signal Processing Systems. August, 2009. New York. DOI: 10.1007/s11265-009-0402-z. [published online] [pdf]
  • S. Seo, M. Woh, S. Mahlke, T. Mudge, S. Vijay, C. Chakrabarti. “Customizing Wide-SIMD Architectures for H.264”, International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation. July, 2009. Greece. pp. 172-179. [pdf]
  • R. Dreslinski, D. Fick, D. Blaauw, D. Sylvester, T. Mudge. “Reconfigurable Multicore Server Processors for Low Power Operation”, International Workshop on Systems, Architectures, Modeling, and Simulation. July, 2009. Greece. pp. 247-254. [pdf]
  • K. Lim, J. Chang, T. Mudge, P. Ranganathan, S. Reinhardt, T. Wenisch. “Disaggregated Memory for Expansion and Sharing in Blade Servers”, The 36th International Symposium on Computer Architecture. June, 2009. Austin, Texas. pp. 267-278. [pdf]
  • A. Saidi, N. Binkert, S. Reinhardt, T. Mudge. “End-To-End Performance Forecasting: Finding Bottlenecks Before They Happen”, The 36th International Symposium on Computer Architecture. June, 2009. Austin, Texas. pp. 361-370. [pdf]
  • M. Woh, S. Seo, S. Mahlke, T. Mudge, C. Chakrabarti, K. Flautner. “AnySP: Anytime Anywhere Anyway Signal Processing”, The 36th International Symposium on Computer Architecture. June, 2009. Austin, Texas. pp. 128-139. [pdf]
  • R. Dreslinski, M. Wieckowski, D. Blaauw, D. Sylvester, T. Mudge. “Near Threshold Computing: Overcoming Performance Degradation from Aggressive Voltage Scaling”, Workshop on Energy-Efficient Design held in conjunction with The 36th International Symposium on Computer Architecture WEED. June, 2009. Austin, Texas. pp. 44-49. [pdf]
  • Ringenberg J., T. Mudge. “SuiteSpecks and SuiteSpots: A Methodology for the Automatic Conversion of Benchmarking Programs into Intrinsically Checkpointed Assembly Code”, 2009 IEEE International Symposium on Performance Analysis of Systems and Software. April, 2009. Boston, Massachusetts. pp. 227-237. [pdf]
  • D. Roberts, T. Kgil and T. Mudge, “Integrating NAND Flash Devices Onto Servers”, Communications of the ACM, Research Highlights. vol 52, no 4. April, 2009. pp. 98-103. [pdf]
  • D. Roberts, T. Kgil, T. Mudge. “Using Non-Volatile Memory to Save Energy in Servers”. Design, Automation and Test in Europe. April, 2009. Nice, France. pp. 743-748. [pdf]
  • Y. Choi, Y. Lin, N. Chong, S. Mahlke, T. Mudge. “Stream Compilation for Real-time Embedded Multicore Systems”, The 2009 International Symposium on Code Generation and Optimization. March, 2009. pp. 210-220. [pdf]
  • K. Sewell, T. Mudge, S. Reinhardt. “EXtreme Virtual Pipelining (XVP): Moving Towards Scalable Multithreaded Processors”, Wild and Crazy Ideas held in conjunction with 16th International Conference on Architectural Support for Programming Languages and Operating Systems. March, 2009. Washington DC. [pdf]
  • K. Lim, P. Ranganathan, J. Chang, C. Patel, T. Mudge, S. Reinhardt. “Server Designs for Warehouse Computing Environments”, IEEE MICRO. vol 29, no 1. January, 2009. pp. 41-49. [pdf]

2008

  • M. Woh, Y. Lin, S. Seo, S. Mahlke, T. Mudge, C. Chakrabarti, R. Bruce, D. Kershaw, A. Reid, M. Wilder, K. Flautner. “From SODA to Scotch: The Evolution of a Wireless Baseband Processor”, November, 2008. Lake Como. pp. 152-163. [Best Paper] [pdf]
  • R. Dreslinski, G. Chen, T. Mudge, D. Blaauw, D. Sylvester, K. Flautner. “Reconfigurable Energy Efficient Near Threshold Cache Architectures”, The 41st IEEE/ACM International Symposium on Microarchitecture. November, 2008. Lake Como, Italy. pp. 459-470. [pdf]
  • T. Kgil, A. Saidi, N. Binkert, S. Reinhardt, K. Flautner, T. Mudge. “PicoServer: Using 3D Stacking Technology To Build Energy Efficient Servers”, ACM Journal on Emerging Technologies in Computing Systems. vol 4, no 4. October, 2008. 33 pp. [pdf]
  • Y. Lin, Y. Choi, S. Mahke, T. Mudge, C. Chakrabarti. “A parameterized dataflow language extension for embedded streaming systems”, International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation. July, 2008. Samos, Greece. pp. 10-17. [pdf]
  • E. Ozer, R. Dreslinski, T. Mudge, S. Biles, K. Flautner. (Eds.) M. Berekovic, N. Dimopoulos, S. Wong. “Energy-efficient simultaneous thread fetch from different cache levels in a soft real-time SMT processor”, Springer-Verlag Berlin Heidelberg. SAMOS VIII Workshop. July, 2008. Greece. pp. 12-22. [pdf]
  • T. Kgil, D. Roberts, T. Mudge. “Improving NAND Flash based disk caches”, The 35th International Symposium on Computer Architecture. June, 2008. Beijing, China. pp. 327-338. [pdf]
  • K. Lim, P. Ranganathan, J. Chang, C. Patel, T. Mudge, S. Reinhardt. “Understanding and designing new server architectures for emerging warehouse-computing environments”, The 35th International Symposium on Computer Architecture. June, 2008. Beijing, China. pp. 315-326. [Top Pick: selected as one of the 12 best papers in computer architecture for 2008] [pdf]
  • A. Saidi, N. Binkert, T. N. Mudge, S. K. Reinhardt. “Full System Critical Path Analysis”, 2008 IEEE International Symposium on Performance Analysis of Systems and Software. April, 2008. pp. 63-74. [pdf]
  • D. Roberts, N. S. Kim, T. Mudge. “On-chip cache device scaling limits and effective fault repair techniques in future nanoscale technology”, Journal on Microprocessors and Microsystems. vol 32. April 2008. pp. 244-253. [pdf]
  • M. Woh, Y. Lin, S. Seo, T. Mudge, S. Mahlke. “Analyzing the scalability of SIMD for the next generation software defined radio”, 33rd International Conference on Acoustics, Speech, and Signal Processing. April, 2008. Las Vegas, Nevada. pp. 5388-5391. [pdf]
  • D. B. C. Tokunaga, T. Mudge. “True Random Number Generator with a Metastability-based Quality Control”, IEEE Journal on Solid-State Circuits. vol 43, no 1. January, 2008. pp. 78-85. [pdf]
  • S. Das, D. Roberts, D. Blaauw, D. Bull, T. Mudge. (Eds.) A. Wang, S. Naffziger. “Architectural Techniques For Adaptive Computing”, Springer Science+Business Media. Adaptive Techniques for Dynamic Processor Optimization. 2008. pp. 175-204. [pdf]

2007

  • G. Chen, D. Blaauw, T. Mudge, D. Sylvester, N. Kim. “Yield-Driven Near-Threshold SRAM Design”, IEEE/ACM International Conference on Computer Aided Design. November, 2007. pp. 660-666. [pdf]
  • Y. Lin, M. Kudlur, S. Mahlke, T. Mudge. “Hierarchical Coarse-grained Stream Compilation for Software Defined Radio”, International Conference on Compiler and Architecture Support for Embedded Systems. October, 2007. Salzburg, Austria. pp. 115-124. [pdf]
  • S. Seo, T. Mudge, Y. Zhu, C. Chakrabarti. “Design and Analysis of LDPC Decoders for Software Defined Radio”, 2007 IEEE Workshop on Signal Processing Systems. October, 2007. Shanghai, China. pp. 210-215. [pdf]
  • R. Dreslinski, B. Zhai, T. Mudge, D. Blaauw, D. Sylvester. “An Energy Efficient Parallel Architecture Using Near Threshold Operation”, 16th International Conference on Parallel Architectures and Compilation Techniques. September, 2007. Romania. pp. 175-188. [pdf]
  • D. Roberts, R. Dreslinski, E. Karl, T. Mudge, D. Sylvester, D. Blaauw. “When Homogeneous becomes Heterogeneous: Wearout Aware Task Scheduling for Streaming Applications”, Workshop on Operating System Support for Heterogeneous Multicore Architectures. September, 2007. Romania. pp. 5-13. [pdf]
  • B. Zhai, R. Dreslinski, D. Blaauw, T. Mudge, D. Sylvester. “Energy Efficient Near-threshold Chip Multi-processing”, International Symposium on Low Power Electronics and Design – 2007. August, 2007. pp. 32-37. [Best Paper Nomination] [pdf]
  • D. Roberts, N. S. Kim, T. Mudge. “On-chip cache device scaling limits and effective fault repair techniques in future nanoscale technology”, 10th EUROMICRO Conference on Digital System Design Architectures, Methods and Tools. August, 2007. Lubeck, Germany. pp. 570-578. [pdf]
  • M. Woh, S. Seo, H. Lee, Y. Lin, S. Mahlke, T. Mudge, C. Chakrabarti, K. Flautner. (Eds.) S. Vassiliadis et al. Springer-Verlag Berlin Heidelberg. “The Next Generation Challenge for Software Defined Radio”, SAMOS VII Workshop. July, 2007. Greece. pp. 343-354. [Best Paper] [pdf]
  • G. Blake, T. Mudge. “Duplicating and Verifying LogTM with OS Support in the M5 Simulator”, Workshop on Duplicating, Deconstructing, and Debunking held in conjunction with the International 34th Symposium on Computer Architecture. June, 2007. San Diego, California. pp. 23-31. [pdf]
  • R. Dreslinski, A. Saidi, T. Mudge, S. Reinhardt. “Analysis of hardware prefetching across virtual page boundaries”, ACM International Conference on Computing Frontiers. May, 2007. Italy. pp. 13-22. [pdf]
  • C. Tokunaga, D. Blaauw, T. Mudge. “True random number generator with a metastable-based quality control”, International Solid-State Circuits Conference. February, 2007. pp. 404-405. [pdf]
  • Y. Lin, H. Lee, M. Woh, Y. Harel, S. Mahlke, T. Mudge, C. Chakrabarti, K. Flautner. “SODA: A High-Performance DSP Architecture for Software-Defined Radio”, IEEE MICRO Top Picks Issue. January, 2007. pp. 114-123. [pdf]

2006

  • Y. Lin, R. Mullenix, M. Woh, S. Mahlke, T. Mudge, A. Reid, K. Flautner. “SPEX: A programming language for software defined radio”, 2006 SDR Technical Conference. November, 2006. Orlando, Florida. Section 2.3, 6 pp. [pdf]
  • Kgil T., D’Souza, S., A. Saidi, N. Binkert, R. Dreslinski, S. Reinhardt, K. Flautner, T. Mudge. “PicoServer: Using 3D stacking technology to enable a compact energy efficient chip multiprocessor”, 12th International Conference on Architectural Support for Programming Languages and Operating Systems. November, 2006. pp. 117-128. [pdf]
  • T. Kgil, T. Mudge. “FlashCache: A NAND Flash memory file cache for low power web servers”, Conference for Compiler and Architecture Support for Embedded Systems. October, 2006. Seoul, South Korea. pp. 103-112. [pdf]
  • H. Lee, C. Chakrabati, T. Mudge. “Reducing idle mode power in software defined radio terminals”, International Symposium on Low Power Electronics and Design – 2006. October, 2006. Tegernsee, Germany. pp. 101-106. [pdf]
  • Y. Lin, S. Mahlke, T. Mudge, C. Chakrabarti, A. Reid, K. Flautner. “Design and implementation of Turbo decoders for software defined radio”, IEEE 2006 Workshop on Signal Processing Systems. October, 2006. Banff, Canada. pp. 22-27. [pdf]
  • E. Karl, D. Sylvester, D. Blaauw, T. Mudge. “Reliability modeling and management in dynamic microprocessor-based systems”, The ACM/IEEE Design Automation Conference. June, 2006. San Francisco, California. pp. 1057-1060. [pdf]
  • S. Plaza, I. Kountanis, Z. Andraus, V. Bertacco, T. Mudge. “Advances and insights into parallel SAT solving”, International Workshop on Logic and Synthesis. June, 2006. pp. 188-194. [pdf]
  • Y. Lin, H. Lee, M. Woh, Y. Harel, S. Mahlke, T. Mudge, C. Chakrabarti, K. Flautner. “SODA: A low-power architecture for software radio”, 33rd Annual International Symposium on Computer Architecture. June, 2006. Boston, Massachusetts, USA. pp. 89-101. [Top Pick: selected as one of the 12 best papers in computer architecture for 2006] [pdf]
  • S. Das, D. Roberts, S. Lee, S. Pant, D. Blaauw, T. Austin, T. Mudge, K. Flautner. “A self-tuning dynamic voltage scaled processor using delay-error detection and correction”, IEEE International Conference on Integrated Circuit Design and Technology. May, 2006. Padova, Italy. pp. 211-214. [pdf]
  • A. Jerraya, T. Mudge. “Guest editorial: Concurrent hardware and software design for multiprocessor SoC”, ACM Transactions on Embedded Computing Systems. vol 5, no 2. May, 2006. pp. 259-262. [pdf]
  • S. Das, D. Roberts, S. Lee, S. Pant, D. Blaauw, T. Austin, K. Flautner, T. Mudge. “A self-tuning DVS processor using delay-error detection and correction”, IEEE Journal on Solid-State Circuits. vol 41, no 4. April, 2006. pp. 792-804. [pdf]

2005

  • H. Lee, Y. Lin, Y. Harel, M. Woh, S. Mahlke, T. Mudge, K. Flautner. “Software defined radio-A high performance embedded challenge”, 1st International Conference on High Performance Embedded Architectures and Compilers. November, 2005. Barcelona, Spain. [pdf]
  • D. Oehmke, N. Binkert, T. Mudge, S. Reinhardt. “How to fake 1000 registers”, 38th Annual IEEE/ACM Symposium Microarchitecture. November, 2005. pp. 7-18. [Best Paper Nomination] [pdf]
  • Y. Lin, H. Lee, Y. Harel, M. Woh, N. Baron, S. Mahlke, T. Mudge, K. Flautner. “A system solution for high-performance, low-power SDR”, Proc. 2005 SDR Technical Conf. November, 2005. Annaheim, California. [pdf]
  • N. S. Kim, T. Kgil, K. Bowman, V. De, T. Mudge. “Total power-optimal pipelining and parallel processing under process variations in nanometer technology”, International Conference of Computer Aided Design ICCAD. November, 2005. pp. 535-540. [pdf]
  • N.S. Kim, D. Blaauw and T. Mudge, “Quantitative analysis and optimization techniques for on-chip cache leakage power”, IEEE Transactions on VLSI. vol 13, no 10. October, 2005. pp. 1147-1156. [pdf]
  • H. Lee, T. Mudge. “A dual processor solution for the MAC layer of a software defined radio terminal”, Conference on Compiler and Architecture Support for Embedded Systems. September, 2005. California. pp. 257-265. [pdf]
  • T. Mudge.“Introduction to the special issue of the IEEE Transactions in Computers on Energy Efficient Computing”, IEEE Transactions in Computers on Energy Efficient Computing. vol 54, no 6. June, 2005. pp. 641-641. [pdf]
  • W. Shi, H. H. Lee, G. Gu, M. Ghosh, L. Falk, T. Mudge. “An Intrusion tolerant and self-recoverable network service system using security enhanced chip multiprocessors”, The 2nd International Conference on Autonomic Computing. June, 2005. Seattle, Washington. [pdf]
  • S. Das, S. Pant, D. Roberts, S. Lee, D. Blaauw, T. Austin, T. Mudge, K. Flautner. “A self-tuning DVS processor using delay-error detection and correction”, 2005 Symposium on VLSI Circuits. June, 2005. Kyoto, Japan. pp. 258-261. [pdf]
  • A. Cheng, G. Tyson, T. Mudge. “PowerFITS: Reduce Dynamic and Static I-cache power using application specific instruction set synthesis”, The IEEE International Symposium on Performance Analysis of Systems and Software. March, 2005. Austin, Texas. pp. 32-41. [pdf]
  • R. Bai, N. S. Kim, T. Mudge, D. Sylvester. “Power-performance trade-offs in nanometer-scale multi-level caches considering total leakage”, Design, Automation and Test in Europe. March, 2005. Munich, Germany. pp. 650-651. [pdf]
  • D. Roberts, T. Austin, D. Blaauw, T. Mudge. “Error analysis for the support of robust voltage scaling”,6th International Symposium on Quality Electronic Design. March, 2005. pp. 65-70. [pdf]
  • J. Ringenberg, C. Pelosi, D. Oehmke, T. Mudge. “Intrinsic checkpointing: A methodology for decreasing simulation time through binary modification”, IEEE International Symposium on Performance Analysis of Systems and Software. March, 2005. Austin, Texas. pp. 78-88. [pdf]
  • R. Bai, N. S. Kim, D. Sylvester, T. Mudge. “Total leakage optimization strategies for multi-level caches”, ACM/IEEE Great Lakes Symposium on VLSI. 2005. pp. 381-384. [pdf]
  • H. Kaul, D. Sylvester, D. Blaauw, T. Mudge, T. Austin. “DVS for on-chip bus designs based on timing error correction”, Design, Automation and Test in Europe. 2005. Munich, Germany. pp. 80-85. [pdf]
  • T. Austin, V. Bertacco, D. Blaauw, T. Mudge. “Opportunities and challenges for better than worst-case design”, Asia South Pacific Design Automation Conference. vol 1. January, 2005. China. pp. I/2-I/7. [pdf]

2004

  • D. Ernst, S. Das, S. Lee, D. Blaauw, T. Austin, T. Mudge, N. S. Kim, K. Flautner. “Razor: circuit-level correction of timing errors for low-power operation”, IEEE MICRO. vol 24, no 6. November, 2004. pp. 10-20. [Best Paper, Top Pick: selected as one of the best papers in computer architecture for 2006 from the top conferences of 2003/4: Micro-36, HPCA 10, ISCA 31, PACT 2004, ASPLOS XI] [pdf]
  • T. Kgil, L. Falk, T. Mudge. “ChipLock: Support for Secure Microarchitectures”, Workshop on Architectural Support for Security and Anti-virus held in conjunction with the 11th International Conferference on Architectural Support for Programming Languages and Operating Systems. October, 2004. Boston, Massachusetts. pp. 130-139. [pdf]
  • Y. Lin, N. Baron, H. Lee, S. Mahlke, T. Mudge. “A programmable vector coprocessor architecture for wireless applications”, 3rd Workshop on Application Specific Processors held in conjunction with the International Conference on Hardware/Software Codesign and System Synthesis. September, 2004. Stockholm. pp. 103-110. [pdf]
  • N. Kim, T. Kgil, V. Bertacco, T. Austin, T. Mudge. “Microarchitectural power modeling techniques for deep sub-micron microprocessors”, The International Symposium on Low Power Electronics and Design. August, 2004. Newport Beach, California. pp. 212-217. [pdf]
  • N. Kim, K. Flautner, D. Blaauw, T. Mudge.“Single Vdd and single Vt super-drowsy techniques for low-leakage high-performance instruction caches”, The International Symposium on Low Power Electronics and Design. August, 2004. Newport Beach, California. pp. 54-57. [pdf]
  • S. Lee, T. Austin, D. Blaauw, T. Mudge. “Reducing pipeline energy demands with local DVS and dynamic retiming”, The International Symposium on Low Power Electronics and Design. August, 2004. Newport Beach, California. pp. 319-324. [pdf]
  • A. Cheng, G. Tyson, T. Mudge. “FITS: Framework-based instruction-set tuning synthesis for embedded application specific processors”, The ACM/IEEE Design Automation Conference. June, 2004. San Diego, California. pp. 920-923. [pdf]
  • S. Lee, S. Das, V. Bertacco, T. Austin, D. Blaauw, T. Mudge. “Circuit-aware architectural simulation”, The ACM/IEEE Design Automation Conference. June, 2004. San Diego, California. pp. 305-310. [pdf]
  • T. Austin, D. Blaauw, S. Mahlke, T. Mudge, C. Chakrabati, W. Wolf. “Mobile Supercomputers”, Computer. vol 37, no 5. May, 2004. pp. 81-83. [pdf]
  • A. Cheng, G. Tyson, T. Mudge. “FITS: Increasing code density for embedded systems with a cost-effective 16-bit ISA synthesis technique” 2nd IEEE/ACM Workshop on Optimizations for DSP and Embedded Systems held in conjunction with the International Symposium on Code Generation and Optimization”, March, 2004. San Jose, California.
  • N. Kim, K. Flautner, D. Blaauw, T. Mudge. “Circuit and microarchitectural techniques for reducing cache leakage power”, IEEE Transactions on VLSI. vol 12, no 2. February, 2004. pp. 167-184. [pdf]
  • T. Austin, D. Blaauw, T. Mudge, K. Flautner. “Making Typical Silicon Matter with Razor”, Computer. vol 37, no 3. 2004. pp. 57-65. [pdf]

2003

  • D. Ernst, N. Kim, S. Das, S. Pant, T. Pham, R. Rao, C. Ziesler, D. Blaauw, T. Austin, T. Mudge, K. Flautner. “Razor: A low-power pipeline based on circuit-level timing speculation”, The 36th Annual IEEE/ACM Symposium on Microarchitecture. December, 2003. pp. 7-18. [Best paper] [pdf]
  • J. Ringenberg, D. Oehmke, T. Austin, T. Mudge. “SimpleDSP: A Fast and Flexible DSP Processor Model”, 5th Workshop on Media and Streaming Processors held in conjunction with the 36th Annual IEEE/ACM Symposium on Microarchitecture. December, 2003. [pdf]
  • N. Kim, T. Austin, D. Blaauw, T. Mudge, K. Flautner, J. Hu, M. Irwin, M. Kandemir, N. Vijaykrishnan. “Leakage Current: Moore’s Law Meets Static Power”, Computer. vol 36, no 12. December, 2003. pp. 68-75. [pdf]
  • N. Kim, D. Blaauw, T. Mudge. “Leakage power optimization techniques for ultra deep sub-micron multi-level caches”, International Conference of Computer Aided Design. November, 2003. San Jose, California. pp. 627-632. [pdf]
  • N. Kim, T. Mudge, R. Brown. “A 2.3Gb/s fully integrated and synthesizable AES Rijndael core” IEEE Custom Integrated Circuits Conference. September, 2003. pp. 193-196. [pdf]
  • N. Kim, T. Mudge. “Microarchitecture for a low power register file with reduced register ports”, The International Symposium on Low Power Electronics and Design. August, 2003. Seoul, South Korea. pp. 384-389. [pdf]
  • N. Kim, T. Mudge. “Reducing register ports using delayed write-back queues and operand pre-fetch”, International Supercomputing Conference. June, 2003. San Francisco, California. pp. 172-182. [pdf]
  • G. Gao, T. Mudge.“Special Issue on Compilers, Architecture, and Synthesis for Embedded Systems”, ACM Transactions on Embedded Computer Systems. vol 2, no 2. 2003. [pdf]

2002

  • K. Flautner, T. Mudge. “Vertigo: Automatic performance-setting for Linux”, The 5th Operating Systems Design and Implementation. December, 2002. pp. 105-116. [pdf]
  • S. Martin, K. Flautner, D. Blaauw, T. Mudge. “Combined dynamic voltage scaling and adaptive body biasing for lower power microprocessors under dynamic workloads”, International Conference of Computer Aided Design. November, 2002. San Jose, California. pp. 721-725. [ICCAD Ten Year Retrospective Most Infulential Paper Award][pdf]
  • N. Kim, K. Flautner, D. Blaauw, T. Mudge. “Drowsy instruction caches: Leakage power reduction using dynamic voltage scaling and cache sub-bank prediction”, 35th Annual IEEE/ACM Symposium on Microarchitecture. November, 2002. pp. 219-230. [pdf]
  • K. Flautner, S. Reinhardt, T. Mudge. “Automatic performance setting for dynamic voltage scaling”, ACM Journal on Wireless Networks. vol 8, no 5. September, 2002. pp. 507-520. [pdf]
  • N. Kim, T. Austin, T. Mudge. “Low-energy data cache using sign compression and cache line bisection”, 2nd Annual Workshop on Memory Performance Issues held in conjunction with the 29th Annual International Symposium on Computer Architecture. May, 2002. Anchorage, Alaska. [pdf]
  • K. Flautner, N. Kim, S. Martin, D. Blaauw, T. Mudge.“Drowsy Caches: Simple techniques for reducing leakage power”, The 29th Annual International Symposium on Computer Architecture. May, 2002. Anchorage, Alaska. pp. 148-157. [pdf]
  • D. Blaauw, S. Martin, T. Mudge, K. Flautner. “Leakage current reduction in VLSI systems”, Journal of Circuits, Systems, and Computers. vol 11, no 6. 2002. pp. 621-636.

2001

  • M. Guthaus, J. Ringenberg, D. Ernst, T. Austin, T. Mudge, R. Brown. “MiBench: A free, commercially representative embedded benchmark suite”, IEEE 4th Annual Workshop on Workload Characterization, held in conjunction with The 34th Annual IEEE/ACM Symposium on Microarchitecture. December, 2001. Austin, Texas. pp. 3-14. [pdf]
  • V. Cuppu, B. Jacob, B. Davis, T. Mudge. “High-performance DRAMs in workstation environments”, IEEE Transactions on Computers. vol 50, no 11. November, 2001. pp. 1133-1153. [pdf]
  • K. Flautner, S. Reinhardt, T. Mudge. “Automatic performance setting for dynamic voltage scaling”, 7th Annual International Conference On Mobile Computing and Networking. July, 2001. Rome, Italy. pp. 260-271. [pdf]
  • A. Eden, J. Ringenberg, S. Sparrow, T. Mudge. “Hybrid myths in branch prediction”, 5th World Multiconference on Systemics, Cybernetics and Informatics and the 7th Interenational Conference on Information Systems Analysis and Synthesis. vol XIV. July, 2001. Orlando, Florida. pp. XIV 74-81. [pdf]
  • M. Postiff, D. Greene, S. Raasch, T. Mudge. “Integrating superscalar processor components to implement register caching”, 15th ACM International Conference On Supercomputing. June, 2001. Sorrento, Italy. pp. 348-357. [pdf]
  • B. Jacob, T. Mudge, “Uniprocessor virtual memory without TLBs”, IEEE Transactions on Computers. vol 50, no 5. May, 2001. pp. 482-499. [pdf]
  • T. Mudge. “Power: A first class design constraint”, Computer. vol 34, no 4. April, 2001. pp. 52-57. [pdf]
  • N. Kim, T. Austin, T. Mudge, D. Grunwald. (Eds.) R. Melhem, R. Graybill. “Challenges for architectural level power modeling”, Kluwer Academic Publishers. Power Aware Computing. 2001. [pdf]

2000

  • M. Postiff, D. Greene, T. Mudge. “The store-load address table and speculative register promotion”, 33rd Annual IEEE/ACM Symposium Microarchitecture. December, 2000. pp. 235-244. [pdf]
  • T. Mudge. “Power: A first class design constraint for future architectures”, 7th International Conference on High Performance Computing. December, 2000. Bangalore, India. pp. 215-224.[pdf]
  • D. Van Campenhout, T. Mudge, J. P. Hayes. “Collection and analysis of microprocessor design errors”, IEEE Design and Test. vol 17, no 4. October, 2000. pp. 51-60. [pdf]
  • B. Davis, B. Jacob, T. Mudge. “The new DRAM interfaces: SDRAM, RDRAM and variants”, 3rd International Symposium on High Performance Computing. October, 2000. Tokyo, Japan. pp. 26-31. [pdf]
  • K. Flautner, R. Uhlig, S. Reinhardt, T. Mudge.“Thread-level parallelism and interactive performance of desktop applications”, 9th International Conference on Architectural Support for Programming Languages and Operating Systems. August, 2000. pp. 129-138. [pdf]
  • B. Davis, T. Mudge, B. Jacob. “DDR2 and low latency variants”, Memory Wall Workshop held in conjunction with the 26th Annual International Symposium on Computer Architecture. June, 2000. [pdf]
  • A. Eden, B. Joh, T. Mudge. “Web latency reduction via client-side prefetching”, 2000 IEEE International Symposium on Performance Analysis of Systems and Software. April, 2000. Austin, Texas. pp. 193-200. [pdf]
  • K. Flautner, R. Uhlig, S. Reinhardt, T. Mudge. “Thread level parallelism of desktop applications”, Workshop on Multi-threaded Execution, Architecture and Compilation held in conjunction with the 6th International Symposium on High Performance Computer Architecture. January, 2000. Toulouse.
  • C. Lefurgy, E. Piccininni, T. Mudge. “Reducing code size with run-time decompression”, The 6th International Symposium on High-Performance Computer Architecture. January, 2000. pp. 218-227. [pdf]

PREVIOUS CENTURY


1999

  • C. Lefurgy, E. Piccininni, T. Mudge. “Evaluation of a high performance code compression method”, The 32nd Annual Symposium on Microarchitecture. November, 1999. pp. 93-102. [pdf]
  • D. Van Campenhout, T. Mudge, J. P. Hayes. “Error simulation with conditional error models”, 4th IEEE International High Level Design Validation and Test Workshop. November, 1999. La Jolla, California. pp. 198-205. [pdf]
  • C. Lefurgy, T. Mudge. “Fast software-managed code decompression”, 2nd International Workshop on Compiler and Architecture Support for Embedded Systems. October, 1999. pp. 139-143. [pdf]
  • M. Postiff, G. Tyson, T. Mudge. “Performance limits of trace caches”, Journal of Instruction Level Parallelism. October, 1999. [pdf]
  • D. Van Campenhout, T. Mudge, J. P. Hayes. “High-level test generation for design verification of pipelined microprocessors”, The 36th ACM/IEEE Design Automation Conference. June, 1999. New Orleans, Louisiana. pp. 185-188. [pdf]
  • R. Uhlig, T. Mudge. (Eds.) G. Haring, C. Lindemann, M. Reiser. “Trace-driven memory simulation: A survey”, Springer-Verlag. Performance Evaluation: Origins and Directions. 1999. pp. 97-139. [Abridged from ACM Computing Surveys, vol. 29, no, 2, June 1997, pp. 128-170.]
  • D. Van Campenhout, T. Mudge, K. Sakallah. “Timing verification of sequential dynamic circuits”, IEEE Transactions on Computer-Aided Design. vol 18, no 5. May, 1999. pp. 645-658. [pdf]
  • V. Cuppu, B. Jacob, B. Davis, T. Mudge. “A performance comparison of contemporary DRAM architectures”, The 26th Annual International Symposium on Computer Architecture. May, 1999. pp. 222-233. [pdf]
  • H. Al-Asaad, J. Hayes, T. Mudge. “Modeling and detecting control errors in microprocessors”, International IEEE Conference on DYnamic CONtrol Systems. 1999. [pdf]
  • K. Flautner, G. Tyson, T. Mudge. “MIRVSim: a high level simulator integrated with the MIRV compiler”, Computer Architecture News. vol 27, no 1. March, 1999. pp. 43-46.  [Condensed from Proc. 3rd Workshop on Interaction Between Compilers and Computer Architecture (INTERACT-3) at the 8th Int. Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS-VIII), Oct. 1998] [pdf]

1998

  • A. Eden, T. Mudge. “The YAGS branch predictor”, 31st Annual IEEE/ACM Symposium on Microarchitecture. December, 1998. pp. 69-77. [pdf]
  • C. Lefurgy, T. Mudge. “Code compression for DSP”, 2nd International Workshop on Compiler and Architecture Support for Embedded Systems. vol 3, no 4. December, 1998. [pdf]
  • D. Van Campenhout, T. Mudge, J. P. Hayes. “High-level test generation for design verification of pipelined microprocessors”, 3rd IEEE International High Level Design Validation and Test Workshop. November, 1998. La Jolla, California. pp. 1-8. [pdf]
  • M. Postiff, D. Greene, G. Tyson, T. Mudge. “The limits of instructions level parallelism in SPEC95 applications”, Computer Architecture News. vol 27, no 1. March, 1999. pp. 31-34. [Condensed from Proc. 3rd Workshop on Interaction Between Compilers and Computer Architecture (INTERACT-3) at the 8th Int. Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS-VIII), Oct. 1998] [pdf]
  • K. Flautner, T. Mudge. “Introspective computers”, Wild and Crazy Ideas Session held in conjunction with The 8th International Conference on Architectural Support for Programming Languages and Operating Systems. October, 1998. [pdf]
  • D. Burger, G. Tyson, T. Austin, J. Smith, T. Mudge.“Alcohol Content vs. Flavor: A Case Study”, Zymurgy Magazine. October, 1998.
  • D. Van Campenhout, T. Mudge, J. P. Hayes. “Evaluation of design error models for verification testing of microprocessors”, IEEE 1st International Workshop on Microprocessor Test and Verification. October, 1998. Washington DC. [pdf]
  • D. Van Campenhout, H. Al-Asaad, J. P. Hayes, T. Mudge, R. Brown. “High-level design verification of microprocessors via error modeling”, ACM Transactions on Design Automation of Electronic Systems. vol 3, no 4. October, 1998. pp. 581-599. [pdf]
  • B. Jacob, T. Mudge. “A look at several memory management units, TLB-refill mechanisms, and page table organizations”, 8th International Conference on Architectural Support for Programming Languages and Operating Systems. October, 1998. San Jose, California. pp. 295-306. [pdf]
  • K. Flautner, G. Tyson, T. Mudge. “MIRVSim: a high level simulator integrated with the MIRV compiler”, 3rd Workshop on Interaction Between Compilers and Computer Architecture held in conjunction with the 8th International Conference on Architectural Support for Programming Languages and Operating Systems. October, 1998. [pdf]
  • B. Jacob, T. Mudge. “Virtual memory in contemporary microprocessors”, Micro. vol 18, no 4. July, 1998. pp. 60-75. [pdf]
  • C-C. Lee, I-C. Chen, T. Mudge. “Design and performance evaluation of global history dynamic branch predictors”, World Multiconference on Systemics Cybernetics and Informatics, and, The 4th International Conference on Information Systems Analysis and Synthesis. vol 2. July, 1998. Orlando, Florida. pp. 664-671. [pdf]
  • B. Jacob, T. Mudge. “Virtual memory: Issues of implementation”, Computer. vol 31, no 6. June, 1998. pp. 33-43. [pdf]
  • R. Brown, B. Bernhardt, M. LaMacchia, J. Abrokwah, P. Parakh, T. Basso, S. Gold, S. Stetson, C. Gauthier, D. Foster, B. Crawforth, T. McQuire, K. Sakallah, R. Lomax, T. Mudge. “Overview of complementary GaAs technology for high-speed VLSI circuits”, IEEE Transactions on VLSI. vol 6, no 1. March, 1998. pp. 47-51. [pdf]

1997

  • C. Lefurgy, P. Bird, I-C. Cheng, T. Mudge. “Improving code density using compression techniques”, 30th Annual IEEE/ACM Symposium on Microarchitecture. December, 1997. pp. 194-203. [pdf]
  • C. Lee, I. Chen, T. Mudge. “The bi-mode branch predictor”, 30th Annual IEEE/ACM Symposium on Microarchitecture. December, 1997. pp. 4-13. [pdf]
  • M. Kelley, M. Postiff, T. Strong, R. Brown, T. Mudge. “A complementary GaAs 32-bit multiply-accumulate unit”, 31st Asilomar Conference on Signals, Systems, and Computers. November, 1997. pp. 1507-1511. [pdf]
  • H. Al-Asaad, D. Van Campenhout, J. Hayes, T. Mudge. “High-level design verification of microprocessors via error modeling”, IEEE International Workshop on High Level Design Validation and Test. November, 1997. pp. 194-201. [pdf]
  • I-C. Chen, C-C. Lee, T. Mudge. “Instruction prefetching using branch prediction information”, International Conference on Computer Design 97. October, 1997. pp. 593-601. [pdf]
  • O. Olukotun, T. Mudge, R. Brown. “Multilevel performance optimization of pipelined caches”, IEEE Transactions on Computers. vol 46, no 10. October, 1997. pp. 1093-1102. [pdf]
  • I-C. Chen, C-C. Lee, M. Postiff, T. Mudge. “Design optimization for high-speed per-address two-level branch predictors”, International Conference on Computer Design 97. October, 1997. pp. 88-96. [pdf]
  • J. Dundas, T. Mudge. “Improving data cache performance by pre-executing instructions under a cache miss”, 1997 ACM International Conference on Supercomputing. July, 1997. pp. 68-75. [pdf]
  • R. Uhlig, T. Mudge. “Trace-driven memory simulation: A survey”, ACM Computing Surveys. vol 29, no 2. June, 1997. pp. 128-170. [pdf]
  • B. Davis, C. Gauthier, P. Parakh, T. Basso, C. Lefurgy, R. Brown, T. Mudge. “Impact of MCMs on high performance processors”, ASME Advances in Electronic Packaging 97. vol 1. June, 1997. pp. 863-868. [pdf]
  • B. L. Jacob, T. Mudge. “Software-managed address translation”, 3rd Symposium on High Performance Computer Architecture. February, 1997. San Antonio, Texas. pp. 156-167. [pdf]
  • R. Uhlig, D. Nagle, T. Mudge, S. Sechrest. “Trap-driven memory simulation with Tapeworm II”, ACM Transactions on Modeling and Computer Simulation. vol 7, no 1. January, 1997. pp. 7-41. [pdf]

1996

  • T. Mudge. “Strategic directions in computer architecture”, ACM Computing Surveys. vol 28, no 4. December, 1996. pp. 671-678. [Also available online to Surveys subscribers via the URL http://www.acm.org/pubs/contents/journals/surveys/1996-28/#4] [pdf]
  • D. Van Campenhout, T. Mudge, K. Sakallah. “Timing verification of sequential domino circuits”, International Conference on CAD. December, 1996. pp. 127-132. [pdf]
  • J. Pierce, T. Mudge. “Wrong path instruction prefetching”, 29th Annual IEEE/ACM Symposium on Microarchitecture. December, 1996. pp. 165-175. [pdf]
  • R. Brown, T. Basso, P. Parakh, S. Gold, C. Gauthier, R. Lomax, T. Mudge. “Complementary GaAs technology for a GHz microprocessor”, Tech. Digest of the GaAsIC Symposium. November, 1996. pp. 313-316. [pdf]
  • B. L. Jacob, T. Mudge. “Support for nomadism in a global environment”, Workshop on Object Replication and Mobile Computing. October, 1996. San Jose, California. [pdf]
  • I-C. Cheng, J. Coffey, T. Mudge. “Analysis of Branch Prediction via Data Compression”, 7th International Conference on Architectural Support for Programming Languages and Operating Systems. October, 1996. pp. 128-137. [pdf]
  • B. Jacob, P. Chen, S. Silverman, T. Mudge. “An analytical model for designing memory hierarchies”, IEEE Transactions on Computers. vol 45, no 10. October, 1996. pp. 1180-1194. [pdf]
  • D. Van Campenhout, T. Mudge, K. Sakallah. “Timing verification of sequential domino circuits”, TechCon 96. September, 1996. [Available as an electronic document to members of Semiconductor Research Corp.] [pdf]
  • B. L. Jacob, T. Mudge. “The trading function in action”, 7th ACM SIGOPS European Workshop. September, 1996. Connemara, Ireland. pp. 241-247. [pdf]
  • R. Brown, J. Hayes, T. Mudge. “Rapid prototyping & evaluation of high-performance computers”, Conference on Experimental Research in Computer Systems, NSF Experimental Systems. June, 1996. Washington DC. pp. 159-168. [pdf]
  • M. Golden, T. Mudge. “A comparison of two common pipeline structures”, Institution of Electrical Engineers Proc.-E, Computers and Digital Techniques. vol 143, no 3. May, 1996. [pdf]
  • T. Mudge. “Position paper: NSF Workshop on Critical Issues”, Computer Architecture Research. May, 1996. [Electronic document]
  • S. Sechrest, C-C. Lee, T. Mudge. “Correlation and aliasing in dynamic branch predictors”, The 23rd Annual International Symposium on Computer Architecture. May, 1996. pp. 22-32. [pdf]
  • T. Mudge. “Panel report: “How can computer architecture researchers avoid becoming the society for irreproducible results?””, Computer Architecture News. vol 24, no 1. March, 1996. pp. 1-5. [pdf]
  • T. Mudge, P. Bird. “An Instruction Stream Compression Technique”, Department of Electrical Engineering and Computer Science, The University of Michigan, CSE-TR-319-96. November, 1996. [pdf]
  • D. Van Campenhout, T. Mudge, K. Sakallah. “Modeling Domino Logic for Static Timing Analysis”, Department of Electrical Engineering and Computer Science, The University of Michigan, CSE-TR-295-96. June 1996. [pdf]

1995

  • S. Sechrest, C-C. Lee, T. Mudge. “The role of adaptivity in two-level branch prediction”, 28th Annual IEEE/ACM Symposium on Microarchitecture. December, 1995. pp. 264-270. [pdf]
  • D. Van Campenhout, T. Mudge. Timing Analysis of Digital Systems with Gated Clocks,” Department of Electrical and Computer Engineering, The University of Michigan, CSE-TR-257-95, August 1995 [pdf]
  • T. Mudge, I. Chen, J. Coffey. “Limits to Branch Prediction”, Department of Electrical Engineering and Computer Science, The University of Michigan, CSE-TR-282-96. January 1996. [pdf]
  • T. J. Stanley, T. Mudge. “A parallel genetic algorithm for multi-objective microprocessor design”, 6th International Conference on Genetic Algorithms. July, 1995. pp. 597-604. [pdf]
  • K. Sakallah, T. Burks, T. Mudge. “Critical paths in circuits with level-sensitive latches”, IEEE Transactions on VLSI Systems. vol 3, no 2. June, 1995. pp. 273-291. [pdf]
  • R. Uhlig, D. Nagle, T. Mudge, S. Sechrest, J. Emer. “Instruction fetching: Coping with code bloat”, The 22nd Annual International Symposium on Computer Architecture. June, 1995. pp. 345-356. [pdf]
  • B. Davis, T. Mudge. “A Verilog preprocessor for representing datapath components”, 4th International Verilog HDL Conference. March, 1995. pp. 90-98. [pdf]
  • B. Jacob, T. Mudge. “Notes on Calculating Computer Performance”, Department of Electrical and Computer Engineering, The University of Michigan, CSE-TR-231-95, March 1995.[pdf]
  • T. J. Stanley, T. N. Mudge. “A systematic approach to multi-objective computer architecture optimization”, 1995 Conference on Advanced Research in VLSI. March, 1995. pp. 286-300. [pdf]
  • J. Pierce, M. D. Smith, T. Mudge. (Eds.) T. M. Conte, C. E. Gimarc. “Instrumentation tools”, Kluwer Academic Publishers. Fast Simulation of Computer Architectures. 1995. pp. 47-86. [pdf]

1994

  •  D. Nagle, R. Uhlig, T. Stanley, T. Mudge, S. Sechrest and R. Brown. Design trade-offs for software-managed TLBs. ACM Trans. Computer Systems, vol. 12, no. 3, Aug. 1994, pp. 175-205. [pdf]
  •  Michael Golden and Trevor Mudge. A comparison between two pipeline organizations. 27th Ann. IEEE/ACM Symp. Microarchitecture (MICRO-27), Dec. 1994, pp. 153-161. [pdf]
  •  R. Uhlig, D. Nagle, T. Mudge, and S. Sechrest. Trap-driven simulation with Tapeworm II. 6th Int. Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS-VI), Oct. 1994, pp. 132-144. [pdf]
  •  M. Upton, T. Huff, T. Mudge, and R. Brown. Resource allocation in a high clock rate microprocessor. 6th Int. Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS-VI), Oct. 1994, pp. 98-109. [pdf]
  •  D. Nagle, R. Uhlig, T. Mudge, and S. Sechrest. Kernel-based memory simulation. ACM SIGMETRICS Conf. Measurement and Modeling of Computer Systems, May 1994, pp. 286-287. [pdf]
  •  Jim Pierce and Trevor Mudge. The effect of speculative execution on cache performance. IPPS 94, Int. Parallel Processing Symp., Cancun Mexico, Apr. 1994, pp. 172-179. [pdf]
  •  D. Nagle, R. Uhlig, T. Mudge, and S. Sechrest. Optimal allocation of on-chip memory for multiple-API operating systems. Proc. of the 21st Ann. Int. Symp. Computer Architecture, Apr. 1994, pp. 358-369. [pdf]
  •  Jim Pierce and Trevor Mudge. IDtrace – A tracing tool for i486 simulation. MASCOTS, Jan.- Feb. 1994, pp. 419-420. [pdf]

1993

  •  T. Stanley, M. Upton, P. Sherhart, T. Mudge, R. B. Brown. A microarchitectural performance evaluation of a 3.2 GB/s microprocessor bus. The Ann. ACM/IEEE Int. Symp. Microarchitecture, MICRO-26, Austin, TX, Dec. 1993, pp. 31-40. [pdf]
  •  R. Brown, M. Upton, A. Chandna, T. Huff, T. Mudge, and R. Oettel. Gallium arsenide process evaluation based on a RISC microprocessor example. IEEE Jour. Solid-State Circuits, vol. 28, no. 10, Oct. 1993, pp. 1030-1037. [pdf]
  •  K. Sakallah, T. Mudge, T. Burks, and E. Davidson. Synchronization of pipelines. IEEE Trans. CAD of IC’s and Systems, vol. 12, no. 8, Aug. 1993, pp. 1132-1146. [pdf]
  •  T. Huff, M. Upton, T. Mudge, R. Brown. The Aurora project. Record of Hot Chips V, Aug. 1993, pp. 3.2.1 – 3.2.12. [pdf]
  •  D. Nagle, R. Uhlig, T. Stanley, T. Mudge, S. Sechrest and R. Brown. Design tradeoffs for software-managed TLBs. Proc. of the 20th Ann. Int. Symp. Computer Architecture, May 1993, pp.27-38. [pdf]
  •  A. Kayssi, T. Mudge, and K. Sakallah. The impact of signal transition time on path delay computation. IEEE Trans. Circuits and Systems II: Analog and Digital Signal Processing, vol. 40, no. 5, May 1993, pp. 302-309. [pdf]
  •  T. Huff, M. Upton, P. Sherhart, P. Barker, R. McVay, T. Stanley, R. Brown, R. Lomax, T. Mudge, and K. Sakallah. A high performance GaAs microprocessor. Proc. IEEE Laser and Optics Soc. Sarnoff Symp., Princeton, Mar. 1993, no page no. [pdf]
  •  M. Upton, T. Huff, P. Sherhart, P. Barker, R. McVay, T. Stanley, R. Brown, R. Lomax, T. Mudge, and K. Sakallah. A 160,000 transistor GaAs microprocessor. Int. Solid-State Circuits Conf. Digest of Technical Papers, vol. 36, Feb. 1993, pp. 92-93. [pdf]

1992

  •  R. Brown, P. Barker, A. Chandna, T. Huff, R. Lomax, T. Mudge, K. Sakallah, P.J. Sherhart, R. Uhlig, and M. Upton. GaAs RISC processors. GaAs IC Symp., Miami, Oct. 1992, pp. 81-84. [pdf]
  •  R. Brown, A. Chandna, T. Huff, R. Lomax, T. Mudge, R.Oettel, and M. Upton. Compound semiconductor device requirements for VLSI. Proc. 19th Int. Symp. GaAs and Related Compounds, (Institute of Physics Conf. Series No. 129), Karuizawa, Japan, Oct. 1992, pp. 857-862. [pdf]
  •  T. Burks, K. Sakallah, and T. Mudge. Critical paths through level-sensitive latches. Int. Conf. CAD, Nov. 1992, pp. 137-141. [pdf]
  •  R. Brown, A. Chandna, T. Hoy, T. Huff, D. Johnson, R. Lomax, T. Mudge, D. Nagle, O. Olukotun, K. Sakallah, R. Uhlig, and M. Upton. Synthesis and verification of a GaAs microprocessor from a Verilog hardware description. Proc. Open Verilog Int. User Group Meeting, Mar. 1992, pp. 85-93. [pdf]
  •  T. Burks, K. Sakallah, and T. Mudge. Multi-phase retiming using minTc. TAU 92: 1992 ACM/SIGDA W/shop Timing Issues in the Specification and Synthesis of Digital Systems, Princeton Univ., Mar. 1992, 10 pp. [pdf]
  •  O. Olukotun, T. Mudge, and R. Brown. Performance optimization of pipelined primary caches. Proc. 19th Ann. Int. Symp. Computer Architecture, May 1992, pp. 181-190. [pdf]
  •  A. Kayssi, K. Sakallah, R. Brown, R. Lomax, T. Mudge, and T. Huff. Impact of MCMs on system performance optimization. 1992 IEEE Int. Symp. Circuits and Systems, vol. 2 of 6, San Diego, CA, May 1992, pp. 919-922. [pdf]
  •  R. Clapp and T. Mudge. Parallel language constructs for efficient parallel processing. Proc. Hawaii Int. Conf. System Sciences, Jan. 1992, pp. 230-241. [pdf]
  •  K. Sakallah, T. Mudge, and O. Olukotun. Analysis and design of latch-controlled synchronous digital circuits. IEEE Trans. CAD of ICs and Systems, vol. 11, no. 3, Mar. 1992, pp. 322-333. [pdf]
  •  A. Ladd, T. Mudge, and O. Olukotun. Measuring process migration effects using an MP simulator. Scalable Shared Memory Multiprocessors, Ed. M. Dubois and S. Thakkar, Kluwer Academic Publ., 1992, pp. 97-129. [pdf]

1991

  •  K. Sakallah, T. Mudge, T. Burks, and E. Davidson. Optimal clocking of circular pipelines. Proc. Int. Conf. Computer Design: VLSI in Computers and Processors, Oct. 1991, pp. 642-646. [pdf]
  •  A. Kayssi, K. Sakallah, R. Brown, R. Lomax, T. Mudge, and T. Huff. Impact of MCMs on system performance. 1991 Multichip Module Workshop, University of California, Santa Cruz, March 1991, pp. 58-65. [pdf]
  •  O. Olukotun, T. Mudge, and R. Brown. Implementing a cache for a high-performance GaAs microprocessor. Proc. of the 18th Ann. Int. Symp. Computer Architecture, May 1991, pp. 138-147. [pdf]
  •  O. Olukotun, R. Brown, R. Lomax, T. Mudge, and K. Sakallah. Multilevel optimization in the design of a high-performance GaAs microcomputer. IEEE Jour. Solid-State Circuits, vol. 16, no. 5, May 1991, pp. 763-767. [pdf]
  •  T. Mudge, R. Brown, W. Birmingham, J. Dykstra, A. Kayssi, R. Lomax, O. Olukotun, and K. Sakallah. The design of a micro-supercomputer. Computer, Jan. 1991, pp. 57-64. [pdf]
  •  T. Mudge, R. Brown, W. Birmingham, J. Dykstra, A. Kayssi, R. Lomax, O. Olukotun, and K. Sakallah. The design of a GaAs micro-supercomputer. Proc. Hawaii Int. Conf. System Sciences, Jan. 1991, pp. 421-432. [pdf]

1990

  • K. Sakallah, T. Mudge, and O. Olukotun. checkTc and minTc: Timing verification and optimal clocking of synchronous digital circuits. Proc. of IEEE Int. Conf. Computer-Aided Design, Nov. 1990, pp. 552-555. [pdf]
  •  K. Sakallah, T. Mudge, and O. Olukotun. Optimal clocking of synchronous systems. TAU 90: 1990 ACM Int. Workshop Timing Issues in the Specification and Synthesis of Digital Systems, Aug. 1990 pp. 21. [pdf]
  •  O. Olukotun and T. Mudge. Hierarchical gate array routing on a hypercube multiprocessor. Jour. of Parallel and Distributed Computing, 1990, pp. 313-324. [pdf]
  •  K. Sakallah, T. Mudge, and O. Olukotun. Analysis and design of latch-controlled synchronous digital circuits. Proc. of the 27-th ACM/IEEE Design Automation Conf., June 1990, pp. 111-117. Nominated for best paper. [pdf]
  •  R. Clapp, T. Mudge, and J. Smith. Performance of parallel loops using alternative cache consistency protocols on a non-bus multiprocessor. Cache and Interconnect Architectures in Multiprocessors, Ed. M. Dubois and S. Thakkar, Kluwer Academic Publ., 1990, pp. 131-152. [pdf]
  •  R. Clapp and T. Mudge. ADA performance issues. Ada Letters, vol. X, no. 3, Winter 1990, Chapters 1, 2, 3, 4, 5, 6, and 8. [pdf]
  •  R. Clapp, T. Mudge, and D. Winsor. Cache coherence requirements for interprocess rendezvous. Int. Jour. of Parallel Programming, vol. 9, no. 1, Jan. 1990, pp. 31-51. [pdf]
  •  R. Volz, T. Mudge, G. Linstrom. Report On The Embedded AI Languages Workshop, The University of Michigan, Ann Arbor MI, January 1990, pp. 34. [pdf]

1989

  •  J. Hayes and T. Mudge. Hypercube supercomputers. Proc. of the IEEE, vol. 77, no. 12, Dec. 1989, pp. 18291841. [pdf]
  •  G. Buzzard and T. Mudge. Short-latency routing for hypercube multiprocessors. Proc. 4th Conf. Hypercubes, Concurrent Computers & Applications, vol. I, Mar. 1989, pp. 285-291. [pdf]
  •  R. Clapp and T. Mudge. A parallel language for a hypercube multiprocessor. Proc. 4th Conf. Hypercubes, Concurrent Computers & Applications, vol. I, Mar. 1989, pp. 515522. [pdf]
  •  P. Gottschalk, J. Turney, and T. Mudge. Efficient recognition of partially visible objects using a logarithmic complexity matching technique. The Int. Journal of Robotics Research, vol. 8, no. 6, Dec. 1989, pp. 110130. [pdf]
  •  R. Clapp and T. Mudge. Ada on a hypercube. Ada Letters, Mar.- Apr. 1989, pp. 118 – 128. [pdf]
  •  R. Volz, T. Mudge, G. Buzzard, and P. Krishnan. Translation and execution of distributed Ada programs: Is it still Ada? IEEE Trans. Software Engineering, vol. 15, no. 3, Mar. 1989, pp. 281292. [pdf]

1988

  •  R. Volz, T. Mudge, and A. Naylor. Wanted: A new generation of software manufacturing. Standards and Information Technology and Industrial Control, (Eds.) N. Malagardis and T. Williams, North-Holland, Amsterdam 1988, pp. 153168. [pdf]
  •  P. Gottschalk and T. Mudge. Efficient encoding of local shape: Features for 3-d object recognition. Proc. of the 1988 SPIE Cambridge Symp. Optical and Optoelectronic Engineering Intelligent Robots and Computer Vision: Seventh in a Series, SPIE, Cambridge MA, Nov.1988. [pdf]
  •  R. Clapp and T. Mudge. Ada on a hypercube. Proc. 3rd Int. Conf. Hypercube Concurrent Computers & Applications, Jan. 1988, pp. 399-408. [pdf]
  •  G. Buzzard and T. Mudge. High performance hypercube communications. Proc. 3rd Int. Conf. Hypercube Concurrent Computers & Applications, Jan. 1988, pp. 600-609. [pdf]
  •  T. Abdelrahman and T. Mudge. Parallel branch and bound algorithms. Proc. 3rd Int. Conf. Hypercube Concurrent Computers & Applications, Jan. 1988, pp. 1492-1499. [pdf]
  •  D. Winsor and T. Mudge. Analysis of bus hierarchies for multiprocessor. Proc. of the 15th Ann. Int. Symp. Computer Architecture, May 1988, pp. 100-107. [pdf]

1987

  •  T. Abdel-Rahman and T. Mudge. Parallel branch-and-bound algorithms on hypercube multiprocessors. (Abstract) 3rd SIAM Conf. Parallel Processing for Scientific Computing, Dec.1987. [pdf]
  •  T. Mudge. Units of distribution for distributed Ada. Proc. of the Int. Workshop Real-Time Ada Issues, Ada UK and SIGAda, Devon, England, May 1987, also appears in Ada Letters, vol. VII, no. 6, Fall 1987. pp. 64-66. [pdf]
  •  W. Martin, T. Wan, T. Abdel-Rahman, and T. Mudge. Monte Carlo photon transport on shared memory and distributed memory parallel processors. The Int. Journal of Supercomputer Applications, vol. 1, no. 3, Fall 1987, pp. 57-74. [pdf]
  •  T. Mudge, H. Al-Sadoun, and B. Makrucki. A memory interference model for multi-processors based on semi-Markov processes. Institution of Electrical Engineers Proc. E, Computers and Digital Techniques, vol. 134, Part E, no. 4, July 1987, pp. 203-214. [pdf]
  •  D. Winsor and T. Mudge. Crosspoint cache architectures. Proc. of the 1987 Int. Conf. Parallel Processing, Aug. 1987, pp. 266-269. [pdf]
  •  T. Mudge, J. Hayes, and D. Winsor. Multiple bus architectures. Computer, June 1987, pp. 42-48. [pdf]
  •  R. Volz and T. Mudge. Instruction level timing mechanisms for accurate real-time task scheduling. IEEE Trans. Computers, vol. C-36, no. 8, Aug. 1987, pp. 988-993. [pdf]
  •  T. Mudge. An Analysis of hypercube architectures for image pattern recognition algorithms. Proc. of the Society of Photo-optical Instrumentation Engineers Optoelectronics and Laser Applications in Science and Engineering, Image Pattern Recognition Algorithm Implementations, Techniques, and Technology: Critical Review of Technology, SPIE vol. 755, Los Angeles CA, Jan. 1987, pp. 71-83. [pdf]
  •  P. Gottschalk, J. Turney, and T. Mudge. Two-dimensional partially visible object recognition using efficient multidimensional range queries. Proc. of the 1987 Int. Conf. Robotics and Automation, Apr. 1987, pp. 380-386. [pdf]
  •  J. Han, R. Volz, and T. Mudge. Range image segmentation and surface parameter extraction for 3-D object recognition of industrial parts. Proc. of the 1987 Int. Conf. Robotics and Automation, Apr. 1987, pp. 1582-1589. [pdf]
  •  J. Hayes, R. Jain, W. Martin, T. Mudge, L. Scott, K. Shin, and Q. Stout. Hypercube computer research at The University of Michigan. Hypercube Multiprocessors 1987, (Ed.) M. Heath, Society for Industrial and Applied Mathematics, Proc. of the 1986 Conf. Hypercube Multiprocessors, pp. 382-394. [pdf]
  •  T. Mudge, G. Buzzard, and T. Abdel-Rahman. A high performance operating system for the NCUBE. Hypercube Multiprocessors 1987. (Ed.) M. Heath, Society for Industrial and Applied Mathematics, Proc. of the 1986 Conf. Hypercube Multiprocessors, pp. 90-99. [pdf]
  •  W. Martin, T-C. Wan, D. Poland, T. Mudge, and T. Abdel-Rahman. Monte Carlo photon transport on the NCUBE. Hypercube Multiprocessors 1987, (Ed. M. Heath), Society for Industrial and Applied Mathematics, Proc. of the 1986 Conf. Hypercube Multiprocessors, pp. 454-463. [pdf]
  •  O. Olukotun and T. Mudge. A preliminary investigation into parallel routing on a hypercube computer. Proc. of the 24-th Design Automation Conf., Miami Beach, FL, June 1987, pp. 814-820. [pdf]
  •  R. Volz and T. Mudge. Timing issues in the distributed execution of Ada programs. IEEE Trans. Computers, vol. C-36, no. 4, Apr. 1987, pp. 449-459. [pdf]
  •  T. Mudge and T. Abdel-Rahman. Architectures for robot vision. Specialized Computer Architectures for Robotics and Automation, (Ed.) J. Graham, Gordon and Breach Science Publ., 1987, pp. 103-149. [pdf]
  •  T. Mudge, J. Turney, and R. Volz. Automatic generation of salient features for the recognition of partially occluded parts. Robotica, vol. 5, 1987, pp. 117-127. [pdf]
  •  T. Mudge and T. Abdel-Rahman. Vision algorithms for hypercube machines. Jour. of Parallel and Distributed Computing, 79-94 (1987). [pdf]

1986

  •  R. Clapp, T. Mudge, and R. Volz. Solutions to the N queens problem using tasking in Ada. SIGPLAN Notices, vol. 21, no. 12, Dec. 1986, pp. 99-110. [pdf]
  •  R. Volz and T. Mudge. Instruction level mechanisms for accurate real-time task scheduling. Proc. of the 1986 Real-time Systems Symp., Dec. 1986, pp. 209-215. [pdf]
  •  R. Volz, T. Mudge, G. Buzzard, and P. Krishnan. Translation and execution of distributed Ada programs: Is it still Ada? Proc. of the Society of Photo-optical Instrumentation Engineers Cambridge Symp. Advances in Intelligent Robotics Systems, SPIE vol. 727, Cambridge MA, Oct. 1986. [pdf]
  •  T. Mudge, J. Hayes, G. Buzzard and D. Winsor. Analysis of multiple-bus interconnection networks. Jour. of Parallel and Distributed Computing, 328-343 (1986). [pdf]
  •  J. Hayes, T. Mudge, Q. Stout, S. Colley, and J. Palmer. A microprocessor-based hypercube supercomputer. IEEE MICRO, Oct. 1986, pp. 6-17. [Best IEEE MICRO article of 1986.] Also to appear in, Multi-Microprocessors, (Ed.) A. Gupta, IEEE Computer Society Press, 1987, pp. 250-260. [pdf]
  •  J. Hayes, T. Mudge, Q. Stout, S. Colley, and J. Palmer. Architectures of a hypercube supercomputer. Proc. of the 1986 Int. Conf. Parallel Processing, Aug. 1986, pp. 653-660. [pdf]
  •  R. Clapp, L. Duchesneau, R. Volz, T. Mudge, and T. Schultze. Toward real-time performance benchmarks for Ada. Comm. of the ACM, vol. 29, no. 8, Aug. 1986, pp. 760-778. [pdf]
  •  J. Turney, T. Mudge and R. Volz. Solving the bin of parts problem. Proc. of Vision ’86, a Machine Vision Association of the SME Conf. and Exposition, Detroit, MI, June 1986, pp. 4-21 thru 4-38. [pdf]
  •  T. Mudge. The next generation of hypercube computers. Proc. of the ARO Workshop Future Directions in Computer Architecture and Software, May 1986, pp. 273-275. [pdf]
  •  R. Volz, T. Mudge, A. Naylor, and B. Brosgol. Ada in a manufacturing environment. Proc. of the Control Engineering Conf. and Exposition, Rosemont, IL, May 1986, pp. 433-440. [pdf]
  •  C. Antonelli, R. Volz and T. Mudge. Hierarchical decomposition and simulation of manufacturing cells using Ada. Simulation, 46:4, Apr. 1986, pp. 141-152. [pdf]

1985

  •  R. Dolezal, T. Mudge, J. Turney, and R. Volz. Determining the pose of an object. Proc. of the Society of Photo-optical Instrumentation Engineers 2nd Int. Symp. Completed Vision for Robotics, SPIE vol. 595, Cannes, France, Dec. 1985, pp. 68-71. [pdf]
  •  T. Mudge. Vision algorithms for hypercube machines. Proc. of the IEEE Workshop Computer Architecture for Pattern Analysis and Image Database Management, Nov., 1985, pp. 225-230. [pdf]
  •  T. Mudge and Humoud B. Al-Sadoun. A semi-Markov model for the performance of multiple-bus systems. IEEE Trans. Computers, vol. C-34, no. 10, Oct. 1985, pp. 934-942. [pdf]
  •  J. Turney, T. Mudge, and R. Volz. Recognizing partially occluded parts. IEEE Trans. Pattern Analysis and Machine Intelligence, vol. PAMI-7, no. 4, July 1985, pp. 410-421. [pdf]
  •  T. Mudge and Humoud B. Al-Sadoun. A semi-Markov model for the performance of multiple-bus systems. Proc. of the 1985 Int. Conf. Parallel Processing, Aug. 1985, pp. 521- 530. [pdf]
  •  Humoud B. Al-Sadoun, O. Olukotun, and T. Mudge. Interconnecting off-the-shelf microprocessors. Proc. of the National Computer Conf., (AFIPS Conf. Proc. Vol. 54), July 1985, pp. 175-181. [pdf]
  •  R. Volz, T. Mudge, A. Naylor and J. Mayer. Some problems in distributing real-time Ada programs across machines. Ada in use, Proc. of the 1985 Int. Ada Conf., (Eds.) J. Barnes and G. Fischer, May 1985, pp. 72-84. [pdf]
  •  J. Turney, T. Mudge, and R. Volz. Recognizing partially hidden objects. Proc. of the IEEE Int. Conf. Robotics and Automation, Mar. 1985, pp. 48-54. [pdf]
  •  G. Buzzard and T. Mudge. Object-based computing and the Ada programming language. Computer, Mar. 1985, pp. 11-19. Also in: Object Oriented Computing, (Ed.) G. Peterson, IEEE Computer Society Press 1987, pp. 115-123. [pdf]
  •  R. Volz, T. Mudge, and J. Mayer. Some problems in distributing real time Ada programs across machines. (Abstract) IEEE Computer Society Technical Committee Real-Time Systems Newsletter, Feb. 1985. (Presented at the IEEE Second Workshop Real-Time Operating Systems Nov. 1984). [pdf]
  •  P. Leonard and T. Mudge. System design for local neighborhood processing. Proc. of the Society of Photo-optical Instrumentation Engineers Los Angeles Symp. Algorithms for Image Processing, Jan. 1985. [pdf]

1984

  •  T. Mudge and J. Turney. Unifying robot arm control. IEEE Trans. Industry Applications, vol. IA-20, no. 6, Nov./Dec. 1984, pp. 1554-1563. [pdf]
  •  R. Volz, T. Mudge, and D. Gal. Using Ada as a programming language for robot-based manufacturing cells. IEEE Trans. Systems, Man and Cybernetics, vol. SMC-14, no. 6, Nov./Dec. 1984, pp. 863-878. Also in: Object Oriented Computing, (Ed.) G. Peterson, IEEE Computer Society Press 1987, pp. 99-114. Also in: Control and Programming in Advanced Manufacturing, (Ed.) K. Rathmill, IFS Ltd. (Springer-Verlag), 1988: Bedford, UK. [pdf]
  •  J. Turney, T. Mudge, and R. Volz. Recognizing partially hidden objects. Proc. of the Society of Photo-optical Instrumentation Engineers Cambridge Symp. Intelligent Robots and Computer Vision, Nov. 1984, pp. 108-113. [pdf]
  •  C. Antonelli, R. Volz, and T. Mudge. Hierarchical decomposition and simulation of manufacturing cells. Proc. of the 1984 Winter Simulation Conf., Nov. 1984, pp. 415-423. [pdf]
  •  T. Mudge and Humoud B. Al-Sadoun. Memory interference models with variable interconnection time. IEEE Trans. Computers, vol. C-33, no. 11, Nov. 1984, pp. 1033-1038. [pdf]
  •  R. Volz, T. Mudge, A. Woo, J. Turney, and D. Gal. CAD, robot programming and Ada. Robotics and Artificial Intelligence, (Eds.) M. Brady, L. Gerhardt and H. Davidson, NATO Advanced Studies Institute Series, Series F: Computer and System Sciences, vol. 11, Berlin: Springer-Verlag, 1984, pp. 229-246. [pdf]
  •  R. Rutenbar, T. Mudge and D. Atkins. A class of cellular architectures to support physical design automation. IEEE Trans. CAD of IC’s and Systems, vol. CAD-3, no. 4, Oct. 1984, pp. 264-278. [pdf]
  •  T. Mudge, J. Hayes, G. Buzzard, and D. Winsor. Analysis of multiple-bus interconnection networks. Proc. of the 1984 Conf. Parallel Processing, Aug. 1984, pp. 228-235. Also in: Advanced Computer Architecture, Ed. D. Agrawal, IEEE Computer Society Press 1986, pp. 155-159. [pdf]
  •  R. Volz and T. Mudge. Robots are (nothing more than) abstract data types. Proc. of the SME Conf. Robotics Research: The Next 5 Years and Beyond, (The First World Conf. Robotics Research), Aug. 1984, MS84-493, 15 pp. [pdf]

1983

  •  T. Mudge, J. Turney, and R. Volz. Experiments in occluded parts recognition. Proc. of the Society of Photo-optical Instrumentation Engineers Cambridge Symp. Intelligent Robots, SPIE vol. 449 (part 2), Nov. 1983, pp. 719-725. [pdf]
  •  T. Mudge and T. Abdel-Rahman. Case study of a program for the recognition of occluded parts. Proc. of the 2nd Ann. IEEE Computer Society Workshop Computer Architecture for Pattern Analysis and Image Data Base Management, Pasadena, CA, Oct. 1983, pp. 56-60. [pdf]
  •  R. Rutenbar, T. Mudge, and D. Atkins. Wire routing experiments on a raster pipeline Subarray Machine, Proc. of the IEEE Int. Conf. CAD, Sep. 1983, pp. 135-136. [pdf]
  •  G. Buzzard and T. Mudge. Teaching assembly language programming with ZIP, a Z80 assembly language interpreter. IEEE Trans. Education, vol. E-26, no. 3, Aug. 1983, pp. 91-98. [pdf]
  •  T. Mudge and T. Abdel-Rahman. Efficiency of feature dependent algorithms for the parallel processing of images. Proc. of the Int. Conf. Parallel Processing, Aug. 1983, pp. 369-373. [pdf]
  •  J. Turney, T. Mudge, R. Volz, and M. Diamond. Experiments in occluded parts recognition using the generalized Hough transform. Proc. of the Conf. Artificial Intelligence, Oakland University, Rochester, MI, Apr. 1983. [pdf]
  •  R. Volz, T. Mudge, and D. Gal. Using Ada as a robot system programming language. Proc. of the 13th Int. Symp. Industrial Robots and Robots 7 Conf., Chicago, Apr. 1983, pp. 12-4212-57. [pdf]
  • T. Mudge, G. Buzzard, D. Verhaeghe, J. Hill, and D. Winsor. Object-based computer architectures. Proc. of the 1983 Conf. Information Sciences and Systems, The Johns Hopkins University, Mar. 1983, pp. 733-741. [pdf]
  •  C.S.G. Lee and T. Mudge. Advanced control for multirobot assembly systems. Proc. of the 10th Conf. Production Research and Technology, (NSF Grantees Conf.), Detroit, MI, Feb. 1983, pp. 129-135. [pdf]

1982

  •  T. Mudge and B. Makrucki. An approximate queueing model for packet switched multistage interconnection networks. Proc. of the 3rd Int. Conf. Distributed Computing Systems, Oct. 1982, pp. 556-562. [pdf]
  •  T. Mudge and J. Turney. Unifying robot arm control. Proc. of the 1982 Ann. Meeting of the Industry Applications Society, Oct. 1982, pp. 1315-1324. Also appears in: IEEE Trans. Industry Applications. [pdf]
  •  T. Mudge, R. Volz, and D. Atkins. Hardware/software transparency in robotics through object level design. Proc. of the Society of Photo-optical Instrumentation Engineers Technical Symp. West, SPIE vol. 360, Aug. 1982, pp. 216-223. [pdf]
  •  T. Mudge, E. Delp, L. Siegel, and H. Siegel. Image coding using the multimicroprocessor system PASM. Proc. of the IEEE Computer Society Conf. Pattern Recognition and Image Processing, June 1982, pp. 200-205. [pdf]
  •  C.S.G. Lee, T. Mudge, and J.L. Turney. Hierarchical control structure using special purpose processors for the control of robot arms. Proc. of the IEEE Computer Society Conf. Pattern Recognition and Image Processing, June 1982, pp. 634-640. Also appears in: Tutorial Robotics, C.S.G. Lee, R. Gonzalez, and K. Fu, IEEE Computer Society Press, 1983, pp. 181-187. [pdf]
  •  T. Mudge, R. Rutenbar, R. Lougheed, and D. Atkins. Cellular image processing techniques for VLSI circuit layout validation and routing. Proc. of the 19th Ann. Design Automation Conf., June 1982, pp. 537-543. Also appears in: Selected Reprints VLSI Technologies and Computer Graphics, Henry Fuchs, IEEE Computer Society Press, 1983, pp. 484- 490. [pdf]
  •  C.S.G. Lee, M. Chung, T. Mudge, and J. Turney. On the control of mechanical manipulators. Proc. of the 6th Int. Federation of Automatic Control Symp. Identification and System Parameter Estimation, June 1982, pp. 1454-1459. [pdf]
  •  E. Delp, T. Mudge, L. Siegel, and H. Siegel. Parallel processing for computer vision. Proc. of the Society of Photo-optical Instrumentation Engineers Technical Symp. East, vol. 336 (Robot Vision), May 1982, pp. 161-167. [pdf]
  •  T. Mudge and B. Makrucki. Probabilistic analysis of a crossbar switch. Proc. of the 9th Ann. Int. Symp. Computer Architecture, Apr. 1982, pp. 311-319. [pdf]
  •  T. Mudge and B. Makrucki. Analysis of multistage networks with unique interconnection paths. Proc. of the 14th Southeastern Symp. System Theory, Apr. 1982, pp. 7-11. [pdf]
  •  T. Mudge and B. Makrucki. Analysis of a multiport memory. Proc. of the 16th Ann. Conf. Information Sciences and Systems, Princeton University, Mar. 1982, pp. 639-643. [pdf missing]
  •  T. Mudge and E. Delp. Special purpose architectures for computer vision. Proc. of the 15th Hawaii Int. Conf. Systems Science, Jan. 1982, pp. 378-387. [pdf missing]

1981

  • T. Mudge. Special purpose VLSI processors for industrial robots. (text accompanying invited panel participation), Proc. of the IEEE Computer Society’s 5th Int. Computer Software and Applications Conf., Nov. 1981, pp. 270-271. [pdf]
  • L. Siegel, E. Delp, T. Mudge, and H. Siegel. Block truncation coding on PASM. Proc. of the 19th Ann. Allerton Conf. Communications, Control, and Computing, Oct. 1981, pp. 891-900. [pdf]
  •  T. Mudge. Teaching assembly language using an assembly language interpreter. Proc. of the 1981 National Conf. of the American Society for Engineering Education, University of Southern California, June 1981, pp. 22-27. [Winner of the Curtis Award for best paper in the Computers in Education Division.] [pdf]
  •  J. Turney and T. Mudge. VLSI implementation of a numerical processor for robotics. Proc. of the 27th Int. Instrumentation Symp., Indianapolis, Indiana, Apr. 1981, pp. 169-175. [Received a Best Paper Award.] Also presented at the Instrument Society of America Anaheim Conf., Oct., 1981. [pdf]
  •  T. Mudge, R. Lougheed, and W. Teel. Cellular image processing techniques for checking VLSI circuit layouts. Proc.of the 15-th Ann. Conf. on Information Sciences and Systems, The Johns Hopkins University, Mar.1981, pp. 315-320. [pdf]
  •  T. Mudge, R. Lougheed, and W. Teel. Cellular image processing techniques for checking VLSI circuit layouts. (Abstract) Abstracts of the 1981 ACM Computer Science Conf., St. Louis, Feb. 1981, p. 29. [pdf]
  •  T. Mudge. A course sequence in microprocessor-based digital systems design. IEEE Trans. Education, vol. E-24, no. 1, Feb. 1981, pp. 14-21. [Honorable mention for runner-up to the best paper published in the Trans. in the year 1981.] [pdf]

1980

  •  T. Mudge. Design language for asynchronous multiprocessor systems. (Abstract) Report on the Workshop on Self-Timed Systems, MIT, May 1980, p. 13. [pdf]

1979

  •  T. Mudge. A distributed operating system machine. Proc. of the Louisiana Computer Exposition on Distributed Systems Based on Mini and Micro Computers, University of Southwestern Louisiana, Mar. 1979, pp. 143-166. [pdf]
  • T. Mudge. A distributed operating system machine. Proc. of the 1979 Conf. Information Sciences and Systems, The Johns Hopkins University, Mar. 1979, pp. 472-477. [pdf]

1978

  •  T. Mudge. A computer architecture for parallel processing. Proc. of the 16th Ann. Allerton Conf. Communications, Control, and Computing, Oct. 1978, p. 596. [pdf]
  •  T. Mudge. A data driven computer architecture. Proc. of the 1978 Conf. Information Sciences and Systems, The Johns Hopkins University, March 1978, pp. 365-370. [pdf]

1976

  •  J. Smith and T. Mudge. Characteristics of some augmented Petri nets. Proc. of the 14th Ann. Allerton Conf. Circuit and System Theory, Oct. 1976, pp. 606-615. [pdf]

1975

  •  T. Mudge. Specifying a design language for digital systems. Proc. of the 13th Ann. Allerton Conf. Circuit and System Theory, Oct. 1975, pp. 906-915. [pdf]