2009 | ||
---|---|---|
98 | Joseph Gebis, Leonid Oliker, John Shalf, Samuel Williams, Katherine A. Yelick: Improving Memory Subsystem Performance Using ViVA: Virtual Vector Architecture. ARCS 2009: 146-158 | |
97 | Filip Blagojevic, Costin Iancu, Katherine A. Yelick, Matthew Curtis-Maury, Dimitrios S. Nikolopoulos, Benjamin Rose: Scheduling dynamic parallelism on accelerators. Conf. Computing Frontiers 2009: 161-170 | |
96 | Rajesh Nishtala, Paul Hargrove, Dan Bonachea, Katherine A. Yelick: Scaling communication-intensive applications on BlueGene/P using one-sided communication and overlap. IPDPS 2009: 1-12 | |
95 | Katherine A. Yelick: Ten ways to waste a parallel computer. ISCA 2009: 1 | |
94 | Kamesh Madduri, Samuel Williams, Stéphane Ethier, Leonid Oliker, John Shalf, Erich Strohmaier, Katherine A. Yelick: Memory-efficient optimization of Gyrokinetic particle-to-grid interpolation for multicore processors. SC 2009 | |
93 | Marghoob Mohiyuddin, Mark Hoemmen, James Demmel, Katherine A. Yelick: Minimizing communication in sparse matrix solvers. SC 2009 | |
92 | Krste Asanovic, Rastislav Bodík, James Demmel, Tony Keaveny, Kurt Keutzer, John Kubiatowicz, Nelson Morgan, David A. Patterson, Koushik Sen, John Wawrzynek, David Wessel, Katherine A. Yelick: A view of the parallel computing landscape. Commun. ACM 52(10): 56-67 (2009) | |
91 | Katherine A. Yelick: Technical perspective - Abstraction for parallelism. Commun. ACM 52(9): 88 (2009) | |
90 | Samuel Williams, Jonathan Carter, Leonid Oliker, John Shalf, Katherine A. Yelick: Optimization of a lattice Boltzmann computation on state-of-the-art multicore platforms. J. Parallel Distrib. Comput. 69(9): 762-777 (2009) | |
89 | Samuel Williams, Leonid Oliker, Richard W. Vuduc, John Shalf, Katherine A. Yelick, James Demmel: Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing 35(3): 178-194 (2009) | |
2008 | ||
88 | Costin Iancu, Wei Chen, Katherine A. Yelick: Performance portable optimizations for loops containing communication operations. ICS 2008: 266-276 | |
87 | Katherine A. Yelick: Programming models for petascale to exascale. IPDPS 2008: 1 | |
86 | James Demmel, Mark Hoemmen, Marghoob Mohiyuddin, Katherine A. Yelick: Avoiding communication in sparse matrix computations. IPDPS 2008: 1-12 | |
85 | Samuel Williams, Jonathan Carter, Leonid Oliker, John Shalf, Katherine A. Yelick: Lattice Boltzmann simulation optimization on leading multicore platforms. IPDPS 2008: 1-14 | |
84 | Kaushik Datta, Mark Murphy, Vasily Volkov, Samuel Williams, Jonathan Carter, Leonid Oliker, David A. Patterson, John Shalf, Katherine A. Yelick: Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures. SC 2008: 4 | |
83 | Jack Dongarra, Robert Graybill, William Harrod, Robert F. Lucas, Ewing L. Lusk, Piotr Luszczek, Janice McMahon, Allan Snavely, Jeffrey S. Vetter, Katherine A. Yelick, Sadaf R. Alam, Roy L. Campbell, Laura Carrington, Tzu-Yi Chen, Omid Khalili, Jeremy S. Meredith, Mustafa M. Tikir: DARPA's HPCS Program- History, Models, Tools, Languages. Advances in Computers 72: 1-100 (2008) | |
2007 | ||
82 | Katherine A. Yelick, John M. Mellor-Crummey: Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2007, San Jose, California, USA, March 14-17, 2007 ACM 2007 | |
81 | Wei-Yu Chen, Dan Bonachea, Costin Iancu, Katherine A. Yelick: Automatic nonblocking communication for partitioned global address space programs. ICS 2007: 158-167 | |
80 | Jimmy Su, Katherine A. Yelick: Automatic Communication Performance Debugging in PGAS Languages. LCPC 2007: 232-245 | |
79 | Costin Iancu, Wei Chen, Katherine A. Yelick: Performance Portable Optimizations for Loops Containing Communication Operations. PACT 2007: 411 | |
78 | Katherine A. Yelick, Dan Bonachea, Wei-Yu Chen, Phillip Colella, Kaushik Datta, Jason Duell, Susan L. Graham, Paul Hargrove, Paul N. Hilfinger, Parry Husbands, Costin Iancu, Amir Kamil, Rajesh Nishtala, Jimmy Su, Michael L. Welcome, Tong Wen: Productivity and performance using partitioned global address space languages. PASCO 2007: 24-32 | |
77 | Amir Kamil, Katherine A. Yelick: Hierarchical Pointer Analysis for Distributed Programs. SAS 2007: 281-297 | |
76 | Parry Husbands, Katherine A. Yelick: Multi-threading and one-sided communication in parallel LU factorization. SC 2007: 31 | |
75 | Samuel Williams, Leonid Oliker, Richard W. Vuduc, John Shalf, Katherine A. Yelick, James Demmel: Optimization of sparse matrix-vector multiplication on emerging multicore platforms. SC 2007: 38 | |
74 | Tong Wen, Jimmy Su, Phillip Colella, Katherine A. Yelick, Noel Keen: An adaptive mesh refinement benchmark for modern parallel programming languages. SC 2007: 40 | |
73 | Shivali Agarwal, Rajkishore Barik, Dan Bonachea, Vivek Sarkar, R. K. Shyamasundar, Katherine A. Yelick: Deadlock-free scheduling of X10 computations with bounded resources. SPAA 2007: 229-240 | |
72 | Rajesh Nishtala, Richard W. Vuduc, James Demmel, Katherine A. Yelick: When cache blocking of sparse matrix vector multiply works and why. Appl. Algebra Eng. Commun. Comput. 18(3): 297-311 (2007) | |
71 | Rajesh Nishtala, Richard W. Vuduc, James Demmel, Katherine A. Yelick: When cache blocking of sparse matrix vector multiply works and why. Appl. Algebra Eng. Commun. Comput. 18(3): 297-311 (2007) | |
70 | Katherine A. Yelick, Paul N. Hilfinger, Susan L. Graham, Dan Bonachea, Jimmy Su, Amir Kamil, Kaushik Datta, Phillip Colella, Tong Wen: Parallel Languages and Compilers: Perspective From the Titanium Experience. IJHPCA 21(3): 266-290 (2007) | |
69 | Samuel Williams, John Shalf, Leonid Oliker, Shoaib Kamil, Parry Husbands, Katherine A. Yelick: Scientific Computing Kernels on the Cell Processor. International Journal of Parallel Programming 35(3): 263-298 (2007) | |
68 | Ewing L. Lusk, Katherine A. Yelick: Languages for High-Productivity Computing: the DARPA HPCS Language Project. Parallel Processing Letters 17(1): 89-102 (2007) | |
2006 | ||
67 | Samuel Williams, John Shalf, Leonid Oliker, Shoaib Kamil, Parry Husbands, Katherine A. Yelick: The potential of the cell processor for scientific computing. Conf. Computing Frontiers 2006: 9-20 | |
66 | Hongzhang Shan, Ji Qiang, Erich Strohmaier, Katherine A. Yelick: Performance Analysis of a High Energy Colliding Beam Simulation Code on Four HPC Architectures. ICPP 2006: 237-244 | |
65 | Christian Bell, Dan Bonachea, Rajesh Nishtala, Katherine A. Yelick: Optimizing bandwidth limited problems using one-sided communication and overlap. IPDPS 2006 | |
64 | Katherine A. Yelick: Compilation Techniques for Partitioned Global Address Space Languages. LCPC 2006: 1 | |
63 | Shoaib Kamil, Kaushik Datta, Samuel Williams, Leonid Oliker, John Shalf, Katherine A. Yelick: Implicit and explicit optimizations for stencil computations. Memory System Performance and Correctness 2006: 51-60 | |
62 | Katherine A. Yelick: Performance Advantages of Partitioned Global Address Space Languages. PVM/MPI 2006: 6 | |
61 | Dan Bonachea, Paul Hargrove, Rajesh Nishtala, Michael L. Welcome, Katherine A. Yelick: Poster reception - Optimized collectives for PGAS languages with one-sided communication. SC 2006: 143 | |
60 | Hongzhang Shan, Erich Strohmaier, Ji Qiang, David H. Bailey, Katherine A. Yelick: Particles and contiuum - Performance modeling and optimization of a high energy colliding beam simulation code. SC 2006: 97 | |
59 | Edward Givelberg, Katherine A. Yelick: Distributed Immersed Boundary Simulation in Titanium. SIAM J. Scientific Computing 28(4): 1361-1378 (2006) | |
2005 | ||
58 | Keshav Pingali, Katherine A. Yelick, Andrew S. Grimshaw: Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2005, June 15-17, 2005, Chicago, IL, USA ACM 2005 | |
57 | Wei-Yu Chen, Costin Iancu, Katherine A. Yelick: Communication Optimizations for Fine-Grained UPC Applications. IEEE PACT 2005: 267-278 | |
56 | Jimmy Su, Katherine A. Yelick: Automatic Support for Irregular Computations in a High-Level Language. IPDPS 2005 | |
55 | Amir Kamil, Katherine A. Yelick: Concurrency Analysis for Parallel Programs with Textually Aligned Barriers. LCPC 2005: 185-199 | |
54 | Kaushik Datta, Dan Bonachea, Katherine A. Yelick: Titanium Performance and Potential: An NPB Experimental Study. LCPC 2005: 200-214 | |
53 | Shoaib Kamil, Parry Husbands, Leonid Oliker, John Shalf, Katherine A. Yelick: Impact of modern memory subsystems on cache optimizations for stencil computations. Memory System Performance 2005: 36-43 | |
52 | Katherine A. Yelick: Language innovations for HPCS. PPOPP 2005: 119 | |
51 | Amir Kamil, Jimmy Su, Katherine A. Yelick: Making Sequential Consistency Practical in Titanium. SC 2005: 15 | |
2004 | ||
50 | Benjamin C. Lee, Richard W. Vuduc, James Demmel, Katherine A. Yelick: Performance Models for Evaluation and Automatic Tuning of Symmetric Sparse Matrix-Vector Multiply. ICPP 2004: 169-176 | |
49 | Christian Bell, Wei-Yu Chen, Dan Bonachea, Katherine A. Yelick: Evaluating support for global address space languages on the Cray X1. ICS 2004: 184-195 | |
48 | Jimmy Su, Katherine A. Yelick: Array Prefetching for Irregular Array Accesses in Titanium. IPDPS 2004 | |
47 | Gorden Griem, Leonid Oliker, John Shalf, Katherine A. Yelick: Identifying Performance Bottlenecks on Modern Microarchitectures Using an Adaptable Probe. IPDPS 2004 | |
46 | Eun-Jin Im, Ismail Bustany, Cleve Ashcraft, James Demmel, Katherine A. Yelick: Performance Tuning of Matrix Triple Products Based on Matrix Structure. PARA 2004: 740-746 | |
45 | Eun-Jin Im, Katherine A. Yelick, Richard W. Vuduc: Sparsity: Optimization Framework for Sparse Matrix Kernels. IJHPCA 18(1): 135-158 (2004) | |
44 | Katherine A. Yelick: Special Issue on Automatic Performance Tuning. IJHPCA 18(1): 19 (2004) | |
2003 | ||
43 | Parry Husbands, Costin Iancu, Katherine A. Yelick: A performance analysis of the Berkeley UPC compiler. ICS 2003: 63-73 | |
42 | Christian Bell, Dan Bonachea, Yannick Cote, Jason Duell, Paul Hargrove, Parry Husbands, Costin Iancu, Michael L. Welcome, Katherine A. Yelick: An Evaluation of Current High-Performance Networks. IPDPS 2003: 28 | |
41 | Rich Vuduc, Attila Gyulassy, James Demmel, Katherine A. Yelick: Memory Hierarchy Optimizations and Performance ounds for Sparse A. International Conference on Computational Science 2003: 705-714 | |
40 | Wei-Yu Chen, Arvind Krishnamurthy, Katherine A. Yelick: Polynomial-Time Algorithms for Enforcing Sequential Consistency in SPMD Programs with Arrays. LCPC 2003: 340-356 | |
39 | Ben Liblit, Alexander Aiken, Katherine A. Yelick: Type Systems for Distributed Data Sharing. SAS 2003: 273-294 | |
2002 | ||
38 | Brian R. Gaeke, Parry Husbands, Xiaoye S. Li, Leonid Oliker, Katherine A. Yelick, Rupak Biswas: Memory-Intensive Benchmarks: IRAM vs. Cache-Based Machines. IPDPS 2002 | |
37 | Rich Vuduc, James Demmel, Katherine A. Yelick, Shoaib Kamil, Rajesh Nishtala, Benjamin C. Lee: Performance optimizations and bounds for sparse matrix-vector multiply. SC 2002: 1-35 | |
36 | David L. Oppenheimer, Aaron B. Brown, James Beck, Daniel Hettena, Jon Kuroda, Noah Treuhaft, David A. Patterson, Katherine A. Yelick: ROC-1: Hardware Support for Recovery-Oriented Computing. IEEE Trans. Computers 51(2): 100-107 (2002) | |
2001 | ||
35 | Eun-Jin Im, Katherine A. Yelick: Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY. International Conference on Computational Science (1) 2001: 127-136 | |
2000 | ||
34 | Thinh P. Q. Nguyen, Avideh Zakhor, Katherine A. Yelick: Performance Analysis of an H.263 Video Encoder for VIRAM. ICIP 2000 | |
33 | David Judd, Katherine A. Yelick, Christoforos E. Kozyrakis, David Martin, David A. Patterson: Exploiting On-Chip Memory Bandwidth in the VIRAM Compiler. Intelligent Memory Systems 2000: 122-134 | |
1999 | ||
32 | Remzi H. Arpaci-Dusseau, Eric Anderson, Noah Treuhaft, David E. Culler, Joseph M. Hellerstein, David A. Patterson, Katherine A. Yelick: Cluster I/O with River: Making the Fast Case Common. IOPADS 1999: 10-22 | |
31 | Eun-Jin Im, Katherine A. Yelick: Optimizing Sparse Matrix Vector Multiplication on SMP. PPSC 1999 | |
30 | Arvind Krishnamurthy, Alexander Aiken, Phillip Colella, David Gay, Susan L. Graham, Paul N. Hilfinger, Ben Liblit, Carleton Miyamoto, Geoff Pike, Luigi Semenzato, Katherine A. Yelick: Titanium: A High Performance Java Dialect. PPSC 1999 | |
1998 | ||
29 | Katherine A. Yelick, Luigi Semenzato, Geoff Pike, Carleton Miyamoto, Ben Liblit, Arvind Krishnamurthy, Paul N. Hilfinger, Susan L. Graham, David Gay, Phillip Colella, Alexander Aiken: Titanium: A High-performance Java Dialect. Concurrency - Practice and Experience 10(11-13): 825-836 (1998) | |
1997 | ||
28 | David A. Patterson, Krste Asanovic, Aaron B. Brown, Richard Fromm, Jason Golbus, Benjamin Gribstad, Kimberly Keeton, Christoforos E. Kozyrakis, David Martin, Stylianos Perissakis, Randi Thomas, Noah Treuhaft, Katherine A. Yelick: Intelligent RAM (IRAM): The Industrial Setting, Applications and Architectures. ICCD 1997: 2-7 | |
27 | Richard Fromm, Stylianos Perissakis, Neal Cardwell, Christoforos E. Kozyrakis, Bruce McGaughy, David A. Patterson, Thomas E. Anderson, Katherine A. Yelick: The Energy Efficiency of IRAM Architectures. ISCA 1997: 327-337 | |
26 | Christoforos E. Kozyrakis, Stylianos Perissakis, David A. Patterson, Thomas E. Anderson, Krste Asanovic, Neal Cardwell, Richard Fromm, Jason Golbus, Benjamin Gribstad, Kimberly Keeton, Randi Thomas, Noah Treuhaft, Katherine A. Yelick: Scalable Processors in the Billion-Transistor Era: IRAM. IEEE Computer 30(9): 75-78 (1997) | |
25 | Soumen Chakrabarti, James Demmel, Katherine A. Yelick: Models and Scheduling Algorithms for Mixed Data and Task Parallel Programs. J. Parallel Distrib. Comput. 47(2): 168-184 (1997) | |
1996 | ||
24 | Arvind Krishnamurthy, Klaus E. Schauser, Chris J. Scheiman, Randolph Y. Wang, David E. Culler, Katherine A. Yelick: Evaluation of Architectural Support for Global Address-Based Communication in Large-Scale Parallel Machines. ASPLOS 1996: 37-48 | |
23 | Steve G. Steinberg, Jun Yang, Katherine A. Yelick: Performance Modeling and Composition: A Case Study in Cell Simulation. IPPS 1996: 68-74 | |
22 | Katherine A. Yelick: Systems Support for Irregular Parallel Applications (Abstract). IRREGULAR 1996: 145 | |
21 | Arvind Krishnamurthy, Katherine A. Yelick: Analyses and Optimizations for Shared Address Space Programs. J. Parallel Distrib. Comput. 38(2): 130-144 (1996) | |
1995 | ||
20 | Chih-Po Wen, Katherine A. Yelick: Portable Runtime Support for Asynchronous Simulation. ICPP (2) 1995: 196-205 | |
19 | Remzi H. Arpaci-Dusseau, David E. Culler, Arvind Krishnamurthy, Steve G. Steinberg, Katherine A. Yelick: Empirical Evaluation of the CRAY-T3D: A Compiler Perspective. ISCA 1995: 320-331 | |
18 | Arvind Krishnamurthy, Katherine A. Yelick: Optimizing Parallel Programs with Explicit Synchronization. PLDI 1995: 196-204 | |
17 | Katherine A. Yelick, Chih-Po Wen, Soumen Chakrabarti, Etienne Deprit, Jeff A. Jones, Arvind Krishnamurthy: Portable Parallel Irregular Applications. PSLS 1995: 157-173 | |
16 | Jeff A. Jones, Katherine A. Yelick: Parallelizing the Phylogeny Problem. SC 1995 | |
15 | Soumen Chakrabarti, James Demmel, Katherine A. Yelick: Modeling the Benefits of Mixed Data and Task Parallelism. SPAA 1995: 74-83 | |
1994 | ||
14 | Arvind Krishnamurthy, Katherine A. Yelick: Optimizing Parallel SPMD Programs. LCPC 1994: 331-345 | |
13 | Soumen Chakrabarti, Katherine A. Yelick: Distributed Data Structures and Algorithms for Gröbner Basis Computation. Lisp and Symbolic Computation 7(2-3): 147-172 (1994) | |
1993 | ||
12 | Chih-Po Wen, Katherine A. Yelick: Parallel timing simulation on a distributed memory multiprocessor. ICCAD 1993: 130-135 | |
11 | Soumen Chakrabarti, Katherine A. Yelick: Implementing an Irregular Application on a Distributed Memory Multiprocessor. PPOPP 1993: 169-178 | |
10 | Soumen Chakrabarti, Katherine A. Yelick: On the Correctness of a Distributed Memory Gröbner basis Algorithm. RTA 1993: 77-91 | |
9 | David E. Culler, Andrea C. Arpaci-Dusseau, Seth Copen Goldstein, Arvind Krishnamurthy, Steven Lumetta, Thorsten von Eicken, Katherine A. Yelick: Parallel programming in Split-C. SC 1993: 262-273 | |
8 | Geoffrey Fox, Sanjay Ranka, Michael L. Scott, Allen D. Malony, James C. Browne, Marina C. Chen, Alok N. Choudhary, Thomas Cheatham, Janice E. Cuny, Rudolf Eigenmann, Amr F. Fahmy, Ian T. Foster, Dennis Gannon, Tomasz Haupt, Carl Kesselman, Charles Koelbel, Wei Li, Monica S. Lam, Thomas J. LeBlanc, Jim Openshaw, David A. Padua, Constantine D. Polychronopoulos, Joel H. Saltz, Alan Sussman, Gil Weigand, Katherine A. Yelick: Common runtime support for high-performance parallel languages. SC 1993: 752-757 | |
1992 | ||
7 | Katherine A. Yelick, Stephen J. Garland: A Parallel Completion Procedure for Term Rewriting Systems. CADE 1992: 109-123 | |
6 | Katherine A. Yelick: Programming Models for Irregular Applications. SIGPLAN Workshop 1992: 28-31 | |
5 | Joseph L. Zachary, Katherine A. Yelick: Using Moded Type Systems to Support Abstraction in Logic Programs. Types in Logic Programming 1992: 229-243 | |
1990 | ||
4 | Katherine A. Yelick: Parallel Completion. Dagstuhl Seminar on Parallelization in Inference Systems 1990: 348 | |
1989 | ||
3 | Katherine A. Yelick, Joseph L. Zachary: Moded Type Systems for Logic Programming. POPL 1989: 116-124 | |
1987 | ||
2 | Katherine A. Yelick: Unification in Combinations of Collapse-Free Regular Theories. J. Symb. Comput. 3(1/2): 153-181 (1987) | |
1985 | ||
1 | Katherine A. Yelick: Combining Unification Algorithms for Confined Regular Equational Theories. RTA 1985: 365-380 |