 | M. Heinrich and E. Speight. "Providing Hardware DSM Performance at
Software DSM Cost". Cornell Computer Systems Lab Technical Report
CSL-TR-2000-1008, November 2000. (Postscript)
 | J. Gibson, R. Kunz, D. Ofelt, M. Horowitz, J. Hennessy, and M. Heinrich.
"FLASH vs. (Simulated) FLASH: Closing the Simulation Loop". In Proceedings
of the 9th International Conference on Architectural Support for Programming
Languages and Operating Systems (ASPLOS), pages 49-58, November 2000. (PDF) |
 | A. Chou, B. Chelf, D. Engler, and M. Heinrich. "Using Meta-Level
Compilation to Check FLASH Protocol Code". In Proceedings of the
9th International Conference on Architectural Support for Programming
Languages and Operating Systems (ASPLOS), pages 59--70, November 2000. (Postscript)
 | Evan Speight, Hazim Abdel-Shafi,
and John K. Bennett. WSDLite:
A Lightweight Alternative to Windows Sockets Direct Path. In Proceedings
of the 4th Usenix Windows Systems Symposium. Seattle, WA,
August, 2000. |
 | R. Manohar and M. Heinrich. "A Case For Asynchronous Active
Memories". ISCA 2000 Solving the Memory Wall Problem Workshop,
June 2000. (Postscript)
 | R. Manohar and M. Heinrich. "The Branch Processor Architecture".
Cornell Computer Systems Lab Technical Report CSL-TR-1999-1000,
November 1999. (Postscript)
 | Hazim Abdel-Shafi, Evan Speight,
and John K. Bennett. Efficient
User-Level Checkpointing and Thread Migration in Windows NT Clusters. In
Proceedings of the 3rd Usenix Windows NT Symposium.
Seattle, WA, July 1999. |
 | Evan Speight, Hazim Abdel-Shafi,
and John K. Bennett. Realizing
the Performance Potential of the Virtual Interface Architecture. In Proceedings
of the 13th ACM International Conference on Supercomputing (ICS).
Rhodes, Greece, June 1999. |
 | J. Hennessy, A. Gupta, and M. Heinrich, "Cache-Coherent Distributed
Shared Memory: Perspectives on Its Development and Future Challenges". Proceedings
of the IEEE, 87(3):418-429, Special Issue on Distributed Shared
Memory, March 1999. (Postscript)
 | M. Heinrich, R. Soundararajan, J. Hennessy, and A. Gupta, "A
Quantitatitve Analysis of the Performance and Scalability of Distributed
Shared Memory Cache Coherence Protocols". IEEE Transactions on
Computers, 48(2):205-217, Special Issue on Cache Memory and
Related Problems, February 1999. (PDF)
 | Evan Speight, Hazim Abdel-Shafi,
and John K. Bennett. An
Integrated Shared-Memory/Message Passing API for Cluster-Based
Multicomputing. In Proceedings of the Second International Conference
on Parallel and Distributed Computing and Networks (PDCN). Brisbane,
Australia, December 1998. |
 | M. Heinrich, "The
Performance and Scalability of Distributed Shared Memory Cache Coherence
Protocols". Ph.D. Dissertation, Stanford University,
October 1998.
 | R. Soundararajan, M. Heinrich, B. Verghese, et al. "Flexible Use of
Memory for Replication/Migration in Cache-Coherent DSM
Multiprocessors". In Proceedings of the 25th International
Symposium on Computer Architecture (ISCA), pages 342-355, June 1998.
(Postscript)
 | K. Olukotun, M. Heinrich, and D. Ofelt, "Digital System Simulation:
Methodologies and Examples". In Proceedings of the 35th Design
Automation Conference, pages 658-663, June 1998. (Postscript)
 | Evan Speight and John K. Bennett. Using
Multicast and Multithreading to Reduce Communication in Software DSM Systems.
In Proceedings of the Fourth Symposium on High Performance Architecture (HPCA).
Las Vegas, NV, February 1998. |
 | Evan Speight. Efficient
Runtime Support for Cluster-Based Distributed Shared Memory Multiprocessors.
Ph.D. Thesis, Rice University, August 1997. |
 | Evan Speight and John K. Bennett. Brazos:
A Third Generation DSM System. In Proceedings of the First Usenix
Windows NT Symposium. Seattle, WA, July 1997. |
 | M. Heinrich, D. Ofelt, M. Horowitz, and J. Hennessy,
"Hardware/Software Codesign of the Stanford FLASH Multiprocessor".
In Proceedings of the IEEE Special Issue on Hardware/Software Co-design,
Vol. 85, No. 3, March 1997. (Postscript)
 | John K. Bennett, Katherine E.
Fletcher, and Evan Speight. The
Performance Value of Shared Network Caches in Clustered Multiprocessor
Workstations. In Proceedings of the 16th International Conference on
Distributed Computing Systems (ICDCS). Hong Kong, 1996. |
 | M. Martonosi, D. Ofelt, and M. Heinrich, "Integrating
Performance Monitoring and Communication in Parallel Computers". In
ACM SIGMETRICS International Conference on Measurement and Modeling of
Computer Systems, pp. 138-147, May 1996. (Postscript)
 | J. Hennessy and M. Heinrich, "Hardware/Software Co-Design of
Processors: Concepts and Examples". In Hardware/Software Co-design,
edited by G. de Micheli and M. Sami, Dordecht; Boston: Kluwer Academic
Publishers, c. 1996. (Postscript)
 | C. Holt, M. Heinrich, J.P. Singh, et al., "The
Effects of Latency, Occupancy, and Bandwidth in Distributed Shared Memory
Multiprocessors". Stanford University Technical Report
CSL-TR-95-660, January 1995. (Postscript)
 | W. Evan Speight, Katherine E.
Fletcher, and John K. Bennett. Working
Set Requirements and Performance of Network Caches in Cluster-Based
Multiprocessors. Rice University ELEC TR 9414. 1994. |
 | Katherine E. Fletcher, W. Evan
Speight, and John K. Bennett: Techniques
for Reducing the Impact of Cache Inclusion in Shared Network Cache
Multiprocessors. Rice University ELEC TR 9413, 1994. |
 | M. Heinrich, J. Kuskin, D. Ofelt, et al., "The
Performance Impact of Flexibility in the Stanford FLASH Multiprocessor".
In Proceedings of the 6th International Conference on Architectural
Support for Programming Languages and Operating Systems (ASPLOS), pages
274-285, 1994. (Postscript)
 | Evan Speight. Paraview: Performance
Debugging of Shared Memory Programs. M.S. Thesis, Rice University, April
1994. |
 | J. Kuskin, D. Ofelt, M. Heinrich, et al., "The
Stanford FLASH Multiprocessor". In Proceedings of the 21st
International Symposium on Computer Architecture (ISCA), pages 302-313,
April 1994. (Postscript)
Reprinted in Selected Papers from 25 Years of ISCA, pages 485-496,
August 1998 |
 | John K. Bennett, Sandhya Dwarkadas,
Jay Greenwood, Evan Speight. Willow:
A Scalable Shared Memory Multiprocessor. In Proceedings of
SuperComputing '92. 1992. |
| | | | | | | | | | | | | |