{"id":10,"date":"2018-12-31T19:05:51","date_gmt":"2018-12-31T19:05:51","guid":{"rendered":"https:\/\/wordpress-courses1819.wolfware.ncsu.edu\/csc724-sprg-2019-yh55qbm\/?page_id=10"},"modified":"2025-04-18T21:52:08","modified_gmt":"2025-04-18T21:52:08","slug":"index","status":"publish","type":"page","link":"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/","title":{"rendered":"CSC 724: Advanced Distributed Systems"},"content":{"rendered":"<p style=\"text-align: center\"><strong>Spring 2025<br \/>Credits:<\/strong>\u00a0<strong>3<\/strong><br \/><strong>Meeting Times: Tuesday\/Thursday, 3:00pm &#8211; 4:15pm<\/strong><br \/><strong>Meeting Location: 1212 Engineering Building II<br \/>Assignment submission: <a href=\"https:\/\/moodle-courses2223.wolfware.ncsu.edu\/course\/view.php?id=6734\">Moodle<\/a><br \/>Message board: <a href=\"https:\/\/piazza.com\/ncsu\/spring2025\/csc724\">Piazza<\/a><br \/><\/strong><\/p>\n<h3>\u00a0<\/h3>\n<h3>Instructor Information<\/h3>\n<ul>\n<li><a href=\"http:\/\/www.csc.ncsu.edu\/faculty\/gu\/\">Xiaohui (Helen) Gu<\/a><\/li>\n<li>Office Hours: Tues\/Thurs 3:00pm &#8211; 4:15pm\u00a0 at EBII \u00a03274<\/li>\n<li>Email : xgu AT ncsu.edu<\/li>\n<\/ul>\n<h3>Teaching Assistants\/Graders<\/h3>\n<ul>\n<li>Tanvin Kalra (Grader)<\/li>\n<li>Email : tkalra AT ncsu.edu<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h3>Course Objectives<\/h3>\n<p>This course explores design and implementation principles in modern distributed systems. In particular, the course will emphasize on recent techniques used by real-world distributed systems such as cloud systems, enterprise data center, and peer-to-peer file sharing (e.g., BitTorrent). Students will learn the state of the art in distributed system architectures, algorithms, and performance evaluation methodologies. Topics include canonical distributed concepts such as remote procedure call, distributed objects, replication, distributed system security, consensus protocol, and recent distributed system technologies such as peer-to-peer, grid, autonomic computing, distributed massive data processing\/Google map-reduce, system machine learning,\u00a0 distributed system debugging, multi-core systems, distributed virtualization. On completing this course, the student should be able to the following:<\/p>\n<ul>\n<li>Identify research problems and challenges in distributed systems, (assessed by review and presentation);<\/li>\n<li>List the state-of-art tools and techniques for addressing research problems and challenges in distributed systems (assessed by review and presentation);<\/li>\n<li>Develop and implement new ideas to solve open problems in\u00a0 distributed systems (assessed by project);<\/li>\n<li>Conduct technical reviews, technical writing, and technical presentations\u00a0(assessed by review, project, paper, presentation).<\/li>\n<\/ul>\n<h3>Text Books<\/h3>\n<p>There are no assigned textbooks for this course. Topics will be covered during in-class lectures, and through course notes made available on this web page.<br \/><strong>Links to the supplementary material in the form of research papers related to each topic are included in this syllabus [<a href=\"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/syllabus\">Course Syllabus<\/a>]. PDF for most papers is available through the <a href=\"http:\/\/www.lib.ncsu.edu\/\">NCSU library web site<\/a>, which has full-text access to most recent ACM and IEEE journals and conferences. A number of supplemental distributed system textbooks are also available:<\/strong><br \/><a href=\"http:\/\/www.amazon.com\/Distributed-Systems-Concepts-Design-3rd\/dp\/0201619180\">Distributed Systems: Concepts and Design<\/a>, (4th Edition), G. Coulouris, J. Dollimore, and T. Kindberg<br \/><strong><a href=\"http:\/\/www.amazon.com\/Distributed-Systems-2nd-Sape-Mullender\/dp\/0201624273\">Distributed Systems<\/a>\u00a0(2nd Edition), Sape Mullender<\/strong><br \/><a href=\"http:\/\/www.amazon.com\/Distributed-Systems-Principles-Paradigms-2nd\/dp\/0132392275\/ref=pd_bxgy_b_text_b\/002-0461668-4888807\">Distributed Systems: Principles and Paradigms<\/a>, Andrew S. Tanenbaum, Maarten van Steen<\/p>\n<h3>Course Description<\/h3>\n<p>Distributed systems have become the fundamental computing infrastructure for many important real-world applications such as Internet search engine, media streaming servers, online file sharing, information analytics, and scientific exploration. This course explores design and implementation principles in modern distributed systems. In particular, the course will emphasize on recent techniques used by real-world distributed systems such as peer-to-peer file sharing (e.g., BitTorrent), enterprise data center, and Internet search engine (Google). Students will learn the state of the art in distributed system architectures, algorithms, and performance evaluation methodologies. Topics include i) traditional distributed computing concepts (e.g., distributed objects, middleware, replication, distributed system security, and consensus protocol); and ii) recent emergent distributed system techniques such as peer-to-peer systems, massive data processing, Grid, and autonomic computing. Students will have opportunities to not only learn the common design methodology of many important distributed systems, but also gain hands-on experience through project implementations. The majority of course materials will be drawn from classic papers and current state-of-the-art work. The instructor will lecture for the first half of the semester and students will present papers and projects in the second half of the semester. Students will read and review papers ahead of time, participate in class discussions, present at least one research topic during the course, and do a term project individually or in a two-member team. Students will also write a paper (as well as review other students&#8217; papers) describing their project and present their work at the end of the course, in a &#8220;conference&#8221; format designed to give students an experience similar to that of participating in a professional conference.<\/p>\n<h3>Prerequisites<\/h3>\n<p>CSC501 or equivalent. Programming in C++ or Java in Unix environment. If you are not sure whether you can attend this course, please consult the instructor.<\/p>\n<h3>Tentative Grading Policy<\/h3>\n<p><strong>Written reviews 20%, class participation 30% (presentation: 20%, discussion: 10%), project 50% (proposal writeup 5%, proposal presentation 5%, Project MidReview Presentation\u00a0 5%, demo 15%, final presentation 10%, Final write-up 10%)<\/strong><\/p>\n<h3>Late policy<\/h3>\n<p><strong>Calculated by the time recorded in the assignment emails received to the instructor. Students will lose 25% for each 24-hour period they are late on reviews, project, or paper.<\/strong><\/p>\n<h3>Paper Review<\/h3>\n<p>Review guidelines: Provide a paragraph of summary about the paper, a paragraph of 2-3 strong points of the paper (i.e., Why the paper should be accepted), a paragraph of 2-3 weak points of the paper (i.e., why the paper should be rejected),\u00a0\u00a0brainstorming ideas for developing new research ideas related to the work described in the paper(optional).<\/p>\n<ul>\n<li><a href=\"http:\/\/www-cse.ucsd.edu\/users\/wgg\/CSE210\/howtoread.html\">How to read an engineering research paper<\/a>\u00a0by Bill Griswold<\/li>\n<li><a href=\"http:\/\/www1.cs.columbia.edu\/%7Ekaiser\/relatedwork.htm\">Suggested guidelines for finding &#8220;related work&#8221; for conference papers<\/a>\u00a0by Gail Kaiser<\/li>\n<li><a href=\"https:\/\/homes.cs.washington.edu\/~mernst\/advice\/review-technical-paper.html\">Reviewing a technical paper<\/a>\u00a0by Mike Ernst<strong>\u00a0<\/strong><\/li>\n<\/ul>\n<h3>Project<\/h3>\n<ul>\n<li><a href=\"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/suggested-term-project-topics\/\">Suggested Term Project Topics<\/a>.<\/li>\n<li>Course project development environment: Amazon AWS, Google Cloud, VCL<\/li>\n<\/ul>\n<p>Both project proposal and final report should follow typical paper requirements using\u00a0<a href=\"http:\/\/www.acm.org\/sigs\/publications\/proceedings-templates\">ACM Double-Column Paper format<\/a>. The project proposal should include abstract, introduction, proposed approaches, and related work. The final project report should include a full paper content including abstract, introduction, design and algorithms, experiment evaluation, related work, and conclusion. We will organize a mini-conference for the students to present their project work. Three best papers will be selected during the mini-conference.<\/p>\n<h3>Class Schedule (<i>Tentative<\/i>)<\/h3>\n<table border=\"1\" width=\"1256\">\n<tbody>\n<tr>\n<td width=\"2%\">\u00a0W<\/td>\n<td width=\"5%\">\u00a0Date<\/td>\n<td width=\"15%\">Topic<\/td>\n<td width=\"50%\">Assigned\u00a0Readings<\/td>\n<td width=\"24%\">Assignments<\/td>\n<\/tr>\n<tr>\n<td colspan=\"1\" rowspan=\"2\">1<\/td>\n<td>1\/7<\/td>\n<td colspan=\"1\" rowspan=\"1\"><i><br \/><\/i><br \/>Introduction [<a href=\"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-content\/uploads\/sites\/77\/2025\/01\/lecture-intro.ppt\">slides<\/a>]<\/td>\n<td>\n<ul>\n<li>Chapter 1, Distributed Systems: Concepts and Design<\/li>\n<\/ul>\n<\/td>\n<td colspan=\"1\" rowspan=\"2\">Investigate your term project idea and do preparation for it. A list of candidate project topics will also be provided to you on the class. Talk to the instructor about your project idea and talk to other students in forming a two-three members group. Email the instructor to setup the appointment.\n<p><strong><b>1\/13 midnight:\u00a0<\/b>Review due<\/strong>\u00a0for<\/p>\n<ul>\n<li>L. Lamport, <a href=\"http:\/\/research.microsoft.com\/users\/lamport\/pubs\/time-clocks.pdf\">Time, clocks and the ordering of events in a distributed system<\/a>,\u00a0 Communications ACM 1978, and<\/li>\n<li>Chandy and Lamport, <a href=\"http:\/\/portal.acm.org\/citation.cfm?id=214456&amp;coll=GUIDE&amp;dl=GUIDE&amp;CFID=26963081&amp;CFTOKEN=98284324\">Distributed snapshots: determining global states of distributed systems<\/a>, ACM TOCS 1985.<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td>1\/9<\/td>\n<td>Replication [<a href=\"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-content\/uploads\/sites\/77\/2025\/01\/lecture-replication.ppt\">slides<\/a>]<\/td>\n<td>\n<ul>\n<li>Chapter 14,\u00a0<a href=\"http:\/\/www.amazon.com\/Distributed-Systems-Concepts-Design-3rd\/dp\/0201619180\">Distributed Systems: Concepts and Design<\/a><\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td colspan=\"1\" rowspan=\"2\" width=\"2%\">2<\/td>\n<td width=\"5%\">1\/14<\/td>\n<td width=\"15%\">Project Testbed<\/td>\n<td width=\"50%\">\u00a0<\/td>\n<td colspan=\"1\" rowspan=\"2\" width=\"24%\">Investigate your term project idea and do preparation for it. Talk to the instructor about your project idea and talk to other students in forming a group if you would like to work in a group.\n<p><strong><b>1\/20\u00a0midnight:\u00a0<\/b>Review due<\/strong>\u00a0for<\/p>\n<ul>\n<li>Rowstron and P. Druschel, &#8220;<a href=\"https:\/\/www.cs.cornell.edu\/people\/egs\/615\/pastry.pdf\">Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems<\/a>&#8220;.\u00a0 Middleware 2001, and<\/li>\n<li>Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, and Hari Balakrishnan, &#8220;<a href=\"http:\/\/pdos.csail.mit.edu\/papers\/chord:sigcomm01\/chord_sigcomm.pdf\">Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications<\/a>&#8220;, Proc. of SIGCOMM, 2001.<\/li>\n<\/ul>\n<p><b>Sunday midnight (1\/19): Paper presentation signup<\/b>\u00a0due. Please send an email to the\u00a0TA\u00a0to bid three papers in the list below and list your choices in decreasing order. You will be allocated with one paper to present based on the FCFS policy and paper availability.<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>1\/16<\/td>\n<td>Project Testbed [<a href=\"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-content\/uploads\/sites\/77\/2025\/01\/lecture-research.pdf\">slides<\/a>]<\/td>\n<td>\u00a0<\/td>\n<\/tr>\n<tr>\n<td rowspan=\"2\" width=\"2%\">3<\/td>\n<td width=\"5%\">1\/21<\/td>\n<td width=\"15%\">\u00a0Project Testbed<\/td>\n<td width=\"50%\">\n<p>&nbsp;<\/p>\n<\/td>\n<td rowspan=\"2\" width=\"24%\"><strong><b>1\/27 midnight:\u00a0<\/b>Review due<\/strong>\u00a0for\n<ul>\n<li>I. Cohen and S. Zhang and M. Goldszmidt and J. Symons and T. Kelly and A. Fox,\u00a0<a href=\"https:\/\/dl.acm.org\/doi\/10.1145\/1095809.1095821\">Capturing, indexing, clustering, and retrieving system history<\/a>, Proc. of SOSP 2005, and<\/li>\n<li>I. Cohen and M. Goldszmidt and T. Kelly and J. Symons and J. S. Chase,\u00a0<a href=\"https:\/\/www.usenix.org\/legacy\/event\/osdi04\/tech\/full_papers\/cohen\/cohen.pdf\">Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control<\/a>, Proc. of OSDI 2004.<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">1\/23<\/td>\n<td width=\"15%\">Consensus Protocol<\/td>\n<td width=\"50%\">\n<ul>\n<li><a href=\"http:\/\/portal.acm.org\/citation.cfm?id=214121&amp;coll=portal&amp;dl=ACM&amp;CFID=11971489&amp;CFTOKEN=43811585\">Impossibility of distributed consensus with one faulty process<\/a>, Fischer, Lynch and Patterson, Journal ACM 1985.<\/li>\n<li><a href=\"http:\/\/research.microsoft.com\/users\/lamport\/pubs\/time-clocks.pdf\">Time, clocks and the ordering of events in a distributed system<\/a>, L. Lamport, Communications ACM 1978.<\/li>\n<li><a href=\"http:\/\/portal.acm.org\/citation.cfm?id=214456&amp;coll=GUIDE&amp;dl=GUIDE&amp;CFID=26963081&amp;CFTOKEN=98284324\">Distributed snapshots: determining global states of distributed systems<\/a>, Chandy and Lamport, ACM TOCS 1985.<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td colspan=\"1\" rowspan=\"2\" width=\"2%\">4<\/td>\n<td width=\"5%\">1\/28<\/td>\n<td width=\"15%\">Consensus Protocol<\/td>\n<td width=\"50%\">\n<ul>\n<li><a href=\"http:\/\/portal.acm.org\/citation.cfm?id=214121&amp;coll=portal&amp;dl=ACM&amp;CFID=11971489&amp;CFTOKEN=43811585\">Impossibility of distributed consensus with one faulty process<\/a>, Fischer, Lynch and Patterson, Journal ACM 1985.<\/li>\n<li><a href=\"http:\/\/research.microsoft.com\/users\/lamport\/pubs\/time-clocks.pdf\">Time, clocks and the ordering of events in a distributed system<\/a>, L. Lamport, Communications ACM 1978.<\/li>\n<li><a href=\"http:\/\/portal.acm.org\/citation.cfm?id=214456&amp;coll=GUIDE&amp;dl=GUIDE&amp;CFID=26963081&amp;CFTOKEN=98284324\">Distributed snapshots: determining global states of distributed systems<\/a>, Chandy and Lamport, ACM TOCS 1985.<\/li>\n<\/ul>\n<\/td>\n<td colspan=\"1\" rowspan=\"2\" width=\"24%\">\u00a0<\/td>\n<\/tr>\n<tr>\n<td>1\/30<\/td>\n<td>Consensus Protocol<\/td>\n<td>\n<ul>\n<li><a href=\"http:\/\/portal.acm.org\/citation.cfm?id=214121&amp;coll=portal&amp;dl=ACM&amp;CFID=11971489&amp;CFTOKEN=43811585\">Impossibility of distributed consensus with one faulty process<\/a>, Fischer, Lynch and Patterson, Journal ACM 1985.<\/li>\n<li><a href=\"http:\/\/research.microsoft.com\/users\/lamport\/pubs\/time-clocks.pdf\">Time, clocks and the ordering of events in a distributed system<\/a>, L. Lamport, Communications ACM 1978.<\/li>\n<li><a href=\"http:\/\/portal.acm.org\/citation.cfm?id=214456&amp;coll=GUIDE&amp;dl=GUIDE&amp;CFID=26963081&amp;CFTOKEN=98284324\">Distributed snapshots: determining global states of distributed systems<\/a>, Chandy and Lamport, ACM TOCS 1985.<\/li>\n<li>\u00a0<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td rowspan=\"2\" width=\"2%\">5<\/td>\n<td width=\"5%\">2\/4<\/td>\n<td width=\"15%\">Autonomic Computing [<a href=\"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-content\/uploads\/sites\/77\/2025\/01\/lecture-AC.ppt\">slides<\/a>]<\/td>\n<td width=\"50%\">\n<p>&nbsp;<\/p>\n<ul>\n<li>J. Kephart and D. Chess,\u00a0<a href=\"http:\/\/www.research.ibm.com\/autonomic\/research\/papers\/AC_Vision_Computer_Jan_2003.pdf\">The Vision of Autonomic Computing<\/a>, Computer Magazine, IEEE, 2003.<\/li>\n<li>Jeffrey O. Kephart: Research challenges of autonomic computing. ICSE 2005: 15-22.<\/li>\n<li>I. Cohen and M. Goldszmidt and T. Kelly and J. Symons and J. S. Chase,\u00a0<a href=\"http:\/\/www.ifp.uiuc.edu\/%7Eiracohen\/publications\/OSDI2004.pdf\">Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control<\/a>, Proc. of OSDI, 2004.<\/li>\n<li>I. Cohen and S. Zhang and M. Goldszmidt and J. Symons and T. Kelly and A. Fox,\u00a0<a href=\"http:\/\/www.ifp.uiuc.edu\/%7Eiracohen\/publications\/Cohen_etalSOSP2005.pdf\">Capturing, indexing, clustering, and retrieving system history<\/a>, Proc. of SOSP, 2005.<\/li>\n<li>\u00a0<\/li>\n<\/ul>\n<\/td>\n<td rowspan=\"2\" width=\"24%\"><strong><b>2\/10 midnight: Project proposal due<\/b><\/strong><\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">2\/6<\/td>\n<td width=\"15%\">\n<p>Overlay Networks [<a href=\"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-content\/uploads\/sites\/77\/2025\/02\/lecture-overlay.ppt\">slides<\/a>]<\/p>\n<\/td>\n<td width=\"50%\">\n<ul>\n<li>D. Andersen and H. Balakrishnan and F. Kaashoek and R. Morris,<a href=\"http:\/\/nms.lcs.mit.edu\/papers\/ron-sosp2001.pdf\">\u00a0Resilient Overlay Networks<\/a>, Proc. 18th ACM SOSP, 2001.<\/li>\n<li>Y. Chu and S. G. Rao and S. Seshan and H. Zhang,<a href=\"http:\/\/esm.cs.cmu.edu\/technology\/papers\/Sigmetrics.CaseForESM.2000.pdf\">\u00a0A Case For End System Multicast<\/a>, IEEE Journal on Selected Areas in Communication (JSAC), Special Issue on Networking Support for Multicast&#8221;, 2002.<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td rowspan=\"2\" width=\"1%\">6<\/td>\n<td width=\"5%\">2\/11<\/td>\n<td width=\"15%\">Wellness Day (No classes)<\/td>\n<td width=\"50%\">\u00a0<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">2\/13<\/td>\n<td>\n<p>Peer-to-Peer Systems<\/p>\n<p>[<a href=\"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-content\/uploads\/sites\/77\/2025\/02\/lecture-p2p.ppt\">slides<\/a>]<\/p>\n<\/td>\n<td width=\"15%\">\n<ul>\n<li>Rowstron and P. Druschel, &#8220;<a href=\"http:\/\/freepastry.rice.edu\/PAST\/pastry.pdf\">Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems<\/a>&#8220;.\u00a0 Proc. of Middleware 2001.<\/li>\n<li>Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, and Hari Balakrishnan, &#8220;<a href=\"http:\/\/pdos.csail.mit.edu\/papers\/chord:sigcomm01\/chord_sigcomm.pdf\">Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications<\/a>&#8220;, Proc. of SIGCOMM 2001.<\/li>\n<\/ul>\n<ul>\n<li style=\"list-style-type: none\">\u00a0<\/li>\n<\/ul>\n<\/td>\n<td rowspan=\"1\" width=\"24%\">\n<p><strong><b>2\/17 midnight:\u00a0<\/b>Reviews due<\/strong><\/p>\n<ul>\n<li>Daniel Dean, Hiep Nguyen, Xiaohui Gu, Hui Zhang, Junghwan Rhee, Nipun Arora, Geoff Jiang,\u00a0<a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/socc14.pdf\">PerfScope: Practical Online Server Performance Bug Inference in Production Cloud Computing Infrastructures<\/a>, Proc. of SOCC 2014, and<\/li>\n<li>Hiep Nguyen, Daniel J. Dean, Kamal Kc, Xiaohui Gu,\u00a0<a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/ATC14.pdf\">Insight: In-situ Online Service Failure Path Inference in Production Computing Infrastructures<\/a>&#8220;,\u00a0Proc. of USENIX ATC 2014.<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td rowspan=\"2\" width=\"2%\">7<\/td>\n<td width=\"5%\">2\/18<\/td>\n<td width=\"15%\">Big Data [<a href=\"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-content\/uploads\/sites\/77\/2025\/02\/lecture-dic.ppt\">slides<\/a>]<\/td>\n<td width=\"50%\">\n<ul>\n<li>Jeffrey Dean and Sanjay Ghemawat,\u00a0<a href=\"http:\/\/www.usenix.org\/events\/osdi04\/tech\/full_papers\/dean\/dean.pdf\">MapReduce: Simplified Data Processing on Large Clusters<\/a>. Proc. of OSDI 2004.<\/li>\n<li>Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber,<a href=\"http:\/\/labs.google.com\/papers\/bigtable-osdi06.pdf\">\u00a0Bigtable: A Distributed Storage System for Structured Data<\/a>, OSDI 2006.<\/li>\n<li>Bu\u011fra Gedik, Henrique Andrade, Kun-Lung Wu, Philip S. Yu, and MyungCheol Doo.\u00a0<a href=\"http:\/\/portal.acm.org\/citation.cfm?doid=1376616.1376729\">SPADE: The System S Declarative Stream Processing Engine<\/a>. International Conference on Management of Data, ACM SIGMOD, 2008.<\/li>\n<\/ul>\n<\/td>\n<td rowspan=\"2\" width=\"24%\"><strong><b>2\/24 midnight:\u00a0<\/b>Reviews due<\/strong>\n<ul>\n<li>T. Xu, et al.,\u00a0<a href=\"http:\/\/delivery.acm.org\/10.1145\/2530000\/2522727\/p244-xu.pdf?ip=74.72.67.12&amp;id=2522727&amp;acc=OA&amp;key=24B49002E011608CF08962C63678233A&amp;CFID=390234581&amp;CFTOKEN=43836950&amp;__acm__=1387576623_690938a3fdf7c2a8887cae554ccd6d2f\">Do Not Blame Users for Misconfigurations<\/a>, Proc. of ACM SOSP 2013.<\/li>\n<li>Mona Attariyan, Michael Chow, Jason Flinn, &#8220;<a href=\"https:\/\/www.usenix.org\/system\/files\/conference\/osdi12\/osdi12-final-33.pdf\">X-ray: Automating Root-Cause Diagnosis of Performance Anomalies in Production Software<\/a>&#8220;, Proc. of OSDI 2012.\u00a0<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">2\/20<\/td>\n<td width=\"15%\">System Research Methodology [<a href=\"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-content\/uploads\/sites\/77\/2025\/02\/lecture-methodology.ppt\">slides<\/a>]<\/td>\n<td width=\"50%\">\n<p>&nbsp;<\/p>\n<ul>\n<li>Roy Levin and David D. Redell,\u00a0<a href=\"http:\/\/www.cs.umbc.edu\/cra\/etw98\/writing-papers.pdf\">An Evaluation of the Ninth SOSP Submissions; or, How (and How Not) to Write a Good Systems Paper<\/a>, ACM SIGOPS Operating Systems Review, Vol. 17, No. 3 (July, 1983), pages 35-40.<\/li>\n<li><a href=\"http:\/\/people.csail.mit.edu\/mernst\/advice\/write-technical-paper.html\">Writing a technical paper by Michael Ernst<\/a><\/li>\n<\/ul>\n<ul>\n<li style=\"list-style-type: none\">\u00a0<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td rowspan=\"2\" width=\"2%\">8<\/td>\n<td width=\"5%\">2\/25<\/td>\n<td width=\"15%\">Project Proposal Presentation<\/td>\n<td width=\"50%\">\n<ol>\n<li><strong>Atomic Transactions in Distributed Key Value Store &#8211;<\/strong> Sachin R Doddaguni, Samarth Mahesh Shetty<\/li>\n<li><strong>Autonomous Agentic RAG with Distributed Vector Databases for Scalable Information Retrieval &#8211;<\/strong> Tural Mehtiyev, Anirudh Kaluri, Sagar Dama<\/li>\n<li><strong>Distributed Hash Table (DHT) using Chord<\/strong> &#8211; Apurv Choudari, Harikrishnan Venkatesh, Kruthik Jonnagaddala<\/li>\n<li><strong>Improving Chunking Algoritham for BeeGFS &#8211; <\/strong>Aryan Gupta<strong>, <span style=\"font-weight: 400\">Jayesh Bhagyesh Gajbhar, Tanishq Virendrabhai Todkar<\/span><\/strong><\/li>\n<li><strong>Leveraging Self-Supervised Hybrid Learning for Container Security in Kubernetes: A Detection and Response Framework &#8211; <span style=\"font-weight: 400\">Sumeet Bapurao Khillare, <span style=\"font-weight: 300\">Shanmukh Pawan Moparthi, Chirag Bheemaiah Palanganda Karumbaiah<\/span><\/span><\/strong><\/li>\n<li><strong>LogPress: Optimized Compression and Retrieval of Unstructured Logs &#8211;\u00a0<\/strong>Neel Dudheliya, Pranav Jibhakate, Tanay Gandhi\u00a0<\/li>\n<li><strong>SPHERE: Scalable Proactive Handling for Efficient Resource Expansion<\/strong> &#8211; Kashika Malick, Rajat Chandak, Shubh Nisar<\/li>\n<\/ol>\n<\/td>\n<td rowspan=\"2\" width=\"24%\"><strong><b>3\/3 midnight: <\/b>Reviews due<\/strong>\n<ul>\n<li>Mart\u00edn Abadi et al.,\u00a0<a href=\"https:\/\/www.usenix.org\/system\/files\/conference\/osdi16\/osdi16-abadi.pdf\">TensorFlow: A System for Large-Scale Machine Learning<\/a>, Proc. of OSDI 2016, and<\/li>\n<li>Hiep Nguyen, Zhiming Shen, Xiaohui Gu, Sethuraman Subbiah, John Wilkes, <a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/icac2013.pdf\">AGILE: elastic distributed resource scaling for Infrastructure-as-a-Service<\/a>, Proc. of ICAC 2013.<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">2\/27<\/td>\n<td width=\"15%\">\u00a0Student presentation<\/td>\n<td>\n<ul>\n<li>&#8220;<a href=\"https:\/\/dance.csc.ncsu.edu\/papers\/ACSOS24.pdf\">ClearCausal: Cross Layer Causal Analysis for Automatic Microservice Performance Debugging<\/a>&#8220;,Olufogorehan Tunde-Onadele, Feiran Qin, Xiaohui Gu, Yuhang Lin, 5th IEEE International Conference on Autonomic Computing and Self-Organizing Systems &#8211;\u00a0<strong>Apurv Choudhari (apchoudh)<\/strong><\/li>\n<li>Jingzhu He, Ting Dai, Xiaohui Gu, and Guoliang Jin,\u00a0&#8220;<a href=\"https:\/\/dance.csc.ncsu.edu\/papers\/SOCC20.pdf\">HangFix: Automatically Fixing Software Hang Bugs for Production Cloud Systems<\/a>&#8220;,\u00a0Proc. of ACM Symposium on Cloud Computing (<b>SOCC<\/b>), Renton, WA, October, 2020, pp. 344-357. &#8211;\u00a0<strong>Tanishq Virendrabhai Todkar (ttodkar)<\/strong><\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td rowspan=\"2\" width=\"2%\">9<\/td>\n<td width=\"5%\">3\/4<\/td>\n<td width=\"15%\">Student presentation<\/td>\n<td width=\"50%\">\n<p>&nbsp;<\/p>\n<ul>\n<li><span style=\"font-weight: 400\">Ting Dai, Jingzhu He, Xiaohui Gu, Shan Lu, and Peipei Wang,\u00a0&#8220;<\/span><a style=\"font-weight: 400\" href=\"http:\/\/dance.csc.ncsu.edu\/papers\/SOCC18.pdf\">\u00a0DScope: Detecting Real-World Data Corruption Hang Bugs in Cloud Server Systems<\/a><span style=\"font-weight: 400\">&#8220;,\u00a0<\/span><span style=\"font-weight: 400\">Proc. of ACM Symposium on Cloud Computing (<\/span><b>SOCC<\/b><span style=\"font-weight: 400\">), Carlsbad, CA, October, 2018. &#8211; <strong>Jayesh Bhagyesh Gajbhar (jgajbha)<\/strong><\/span><\/li>\n<li>Daniel Dean, Hiep Nguyen, Xiaohui Gu, Hui Zhang, Junghwan Rhee, Nipun Arora, Geoff Jiang, <a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/socc14.pdf\">PerfScope: Practical Online Server Performance Bug Inference in Production Cloud Computing Infrastructures<\/a>&#8220;, Proc. of SOCC 2014. <strong>&#8211; Kashika Malick (kmalick)<\/strong><\/li>\n<\/ul>\n<\/td>\n<td rowspan=\"2\" width=\"24%\">No paper reading assigned. You should spend time on your term projects.<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">3\/6<\/td>\n<td width=\"15%\">Student presentation<\/td>\n<td width=\"50%\">\n<ul>\n<li>Hiep Nguyen, Zhiming Shen, Yongmin Tan, Xiaohui Gu,&#8221;<a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/icdcs2013.pdf\">FChain: Toward Black-box Online Fault Localization for Cloud Systems<\/a>\u201d, Proc. of ICDCS 2013. &#8211; <strong>Shubh Nisar (snisar)<\/strong><\/li>\n<li>Daniel Dean, Hiep Nguyen, Xiaohui Gu, &#8220;<a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/UBL.pdf\">UBL: Unsupervised Behavior Learning for Predicting Performance Anomalies in Virtualized Cloud Systems<\/a>\u201d, Proc. of ACM International Conference on Autonomic Computing (ICAC), San Jose, CA, September, 2012. <strong>&#8211; Rajat Girish Chandak (rchanda3)<\/strong><\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr style=\"height: 0.55in\">\n<td style=\"padding: 0.75pt;width: 2.74%;height: 0.55in\" rowspan=\"2\" width=\"2%\">\n<p class=\"MsoNormal\">10<\/p>\n<\/td>\n<td style=\"padding: 0.75pt;width: 5.5%;height: 0.55in\" width=\"5%\">\n<p class=\"MsoNormal\" style=\"text-align: center\" align=\"center\">3\/11<\/p>\n<\/td>\n<td style=\"padding: 0.75pt;width: 15.54%;height: 0.55in\" width=\"15%\"><b>Spring Break<\/b><\/td>\n<td style=\"padding: 0.75pt;width: 50.4%;height: 0.55in\" width=\"50%\">\n<ul>\n<li>No Class<\/li>\n<\/ul>\n<\/td>\n<td colspan=\"1\" rowspan=\"2\" valign=\"top\">No paper reading assigned. You should spend time on your<br \/>term projects.<\/td>\n<\/tr>\n<tr style=\"height: 37.2pt\">\n<td style=\"padding: 0.75pt;width: 5.5%;height: 37.2pt\" width=\"5%\">\n<p class=\"MsoNormal\" style=\"text-align: center\" align=\"center\">3\/13<\/p>\n<\/td>\n<td style=\"padding: 0.75pt;width: 15.54%;height: 0.55in\" width=\"15%\"><b>Spring Break<\/b><\/td>\n<td style=\"padding: 0.75pt;width: 50.4%;height: 0.55in\" width=\"50%\">\n<ul>\n<li>No Class<\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td colspan=\"1\" rowspan=\"2\">11<\/td>\n<td>3\/18<\/td>\n<td>Student presentation<\/td>\n<td>\n<ul>\n<li>&#8220;<a class=\"c-link\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/if-at-first-you-dont-succeed-try-try-again-insights-and-llm-informed-tooling-for-detecting-retry-bugs-in-software-systems\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/if-at-first-you-dont-succeed-try-try-again-insights-and-llm-informed-tooling-for-detecting-retry-bugs-in-software-systems\/\" data-sk=\"tooltip_parent\">If At First You Don\u2019t Succeed, Try, Try, Again\u2026? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems<\/a>&#8220;, Bogdan Alexandru Stoica, Utsav Sethi , Yiming Su , Cyrus Zhou, <a class=\"c-link\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/shanlu\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/shanlu\/\" data-sk=\"tooltip_parent\">Shan Lu<\/a>,\u00a0<a class=\"c-link\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/jonathanmace\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/jonathanmace\/\" data-sk=\"tooltip_parent\">Jonathan Mace<\/a>,\u00a0<a class=\"c-link\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/madanm\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/madanm\/\" data-sk=\"tooltip_parent\">Madan Musuvathi<\/a>,\u00a0<a class=\"c-link\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/sumann\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/sumann\/\" data-sk=\"tooltip_parent\">Suman Nath<\/a>, <i data-stringify-type=\"italic\">ACM SOSP 2024<\/i> | November 2024 &#8211; <strong>Tural Mehtiyev (tmehtiy)<\/strong><\/li>\n<li>Mike Chow,\u00a0<a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/chow\">ServiceLab: Preventing Tiny Performance Regressions at Hyperscale through Pre-Production Testing<em>,\u00a0<\/em><\/a>Proc. of OSDI 2024 &#8211; <strong>Harikrishnan Venkatesh (hvenkat2)<\/strong><\/li>\n<\/ul>\n<\/td>\n<td colspan=\"1\" rowspan=\"2\">No paper reading assigned. You should spend time on your term projects.<\/td>\n<\/tr>\n<tr>\n<td>3\/20<\/td>\n<td>\u00a0Project MidReview<\/td>\n<td>\n<ol>\n<li><strong>Atomic Transactions in Distributed Key Value Store &#8211;<\/strong> Sachin R Doddaguni, Samarth Mahesh Shetty<\/li>\n<li><strong>AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service-<\/strong> Anirudh Kaluri, Sagar Dama<\/li>\n<li><b>Unsupervised Model Drift Detection in Federated Learning- <\/b>Tural Mehtiyev<\/li>\n<li><strong>Distributed Hash Table (DHT) using Chord<\/strong> &#8211; Apurv Choudari, Harikrishnan Venkatesh, Kruthik Jonnagaddala<\/li>\n<\/ol>\n<\/td>\n<\/tr>\n<tr>\n<td colspan=\"1\" rowspan=\"2\">12<\/td>\n<td width=\"5%\">3\/25<\/td>\n<td width=\"15%\">Project MidReview<\/td>\n<td>\n<ol>\n<li><strong>Improving Chunking Algoritham for BeeGFS &#8211; <\/strong>Aryan Gupta<strong>, <span style=\"font-weight: 400\">Jayesh Bhagyesh Gajbhar, Tanishq Virendrabhai Todkar<\/span><\/strong><\/li>\n<li><strong>Leveraging Self-Supervised Hybrid Learning for Container Security in Kubernetes: A Detection and Response Framework &#8211; <span style=\"font-weight: 400\">Sumeet Bapurao Khillare, <span style=\"font-weight: 300\">Shanmukh Pawan Moparthi, Chirag Bheemaiah Palanganda Karumbaiah<\/span><\/span><\/strong><\/li>\n<li><strong>LogPress: Optimized Compression and Retrieval of Unstructured Logs &#8211;\u00a0<\/strong>Neel Dudheliya, Pranav Jibhakate, Tanay Gandhi\u00a0<\/li>\n<li><strong>SPHERE: Scalable Proactive Handling for Efficient Resource Expansion<\/strong> &#8211; Kashika Malick, Rajat Chandak, Shubh Nisar<\/li>\n<\/ol>\n<\/td>\n<td colspan=\"1\" rowspan=\"2\">No paper reading assigned. You should spend time on your term projects.<\/td>\n<\/tr>\n<tr>\n<td>3\/27<\/td>\n<td>Student presentation<\/td>\n<td>\n<ul>\n<li><a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/wang-rui\">Rui Wang,\u00a0<em>et al,,\u00a0<\/em><\/a><a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/wang-rui\">\u03bcSlope: High Compression and Fast Search on Semi-Structured Logs,<em>\u00a0Proc. of OSDI 2024<\/em><\/a> &#8211; <strong>Anirudh Kaluri (akaluri)<\/strong><\/li>\n<li><a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/wang-rui\">Y. Sheng et al., Fairness in Serving Large Language Models<em>, Proc. of OSDI 2024<\/em><\/a> &#8211; <strong>Samarth Shetty (sshett22)<\/strong><\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td colspan=\"1\" rowspan=\"2\">13<\/td>\n<td>4\/1<\/td>\n<td>Student presentation<\/td>\n<td>\n<ul>\n<li><a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/wang-rui\">Philipp Moritz et al., Ray: A Distributed Framework for Emerging AI Applications<em>, Proc. of OSDI 2018<\/em><\/a> &#8211; <strong>Tanay Gandhi (tgandhi)<\/strong><\/li>\n<li><span style=\"font-weight: 300\">Mart\u00edn Abadi et al., <\/span><a href=\"https:\/\/www.usenix.org\/system\/files\/conference\/osdi16\/osdi16-abadi.pdf\">TensorFlow: A System for Large-Scale Machine Learning, <\/a><span style=\"font-weight: 300\">Proc. of OSDI 2016. &#8211; <strong>Chirag Bheemaiah Palanganda Karumbaiah (cpalang)<\/strong><\/span><\/li>\n<\/ul>\n<\/td>\n<td colspan=\"1\" rowspan=\"2\">No paper reading assigned. You should spend time on your term projects.<\/td>\n<\/tr>\n<tr>\n<td>4\/3<\/td>\n<td>Student presentation<\/td>\n<td>\n<ul>\n<li>Hiep Nguyen, Zhiming Shen, Xiaohui Gu, Sethuraman Subbiah, John Wilkes,&#8221;<a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/icac2013.pdf\">AGILE: elastic distributed resource scaling for Infrastructure-as-a-Service<\/a>&#8220;, Proc. of USENIX International Conference on Autonomic Computing (ICAC), San Jose, CA, June, 2013. &#8211; <strong>Sachin R Doddaguni (srdodda)<\/strong><\/li>\n<li>Zhiming Shen, Sethuraman Subbiah, Xiaohui Gu, and John Wilkes, <a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/socc11-cloudscale.pdf\">CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems<\/a>, Proc. of ACM SOCC 2011. &#8211; <strong>Kruthik Jonnagaddala Thyagaraja (kjonnag)<\/strong><\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td colspan=\"1\" rowspan=\"2\">14<\/td>\n<td>4\/8<\/td>\n<td>Student presentation<\/td>\n<td>\n<ul>\n<li>Xiaohui Gu, Klara Nahrstedt, <a href=\"https:\/\/dance.csc.ncsu.edu\/gu\/publications-files\/hpdc02.pdf\"><span class=\"s2\">A Scalable QoS-Aware Service Aggregation Model for Peer-to-Peer Computing Grids<\/span><\/a>, Proc. of IEEE International Symposium on High Performance Distributed Computing (HPDC 2002) &#8211; <strong>Aryan Gupta (agupta72)<\/strong><\/li>\n<li><span style=\"font-weight: 400\">Olufogorehan Tunde-Onadele, Yuhang Lin, Xiaohui Gu, and Jingzhu He,\u00a0&#8220;<\/span><a style=\"font-weight: 400\" href=\"https:\/\/dance.csc.ncsu.edu\/papers\/IC2E22.pdf\">Understanding Software Security Vulnerabilities in Cloud Server Systems<\/a><span style=\"font-weight: 400\">&#8220;,<\/span><span style=\"font-weight: 400\">\u00a0Proc. of the 10th IEEE International Conference on Cloud Engineering (<\/span><b>IC2E<\/b><span style=\"font-weight: 400\">), Pacific Grove, CA, September, 2022 &#8211; <strong>Pranav Arvind Jibhakate (pjibhak)<\/strong><\/span><\/li>\n<\/ul>\n<\/td>\n<td colspan=\"1\" rowspan=\"2\">No paper reading assigned. You should spend time on your term projects.<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">4\/10<\/td>\n<td>Student presentation<\/td>\n<td width=\"15%\">\n<ul>\n<li><span style=\"font-weight: 400\"><a style=\"font-weight: 400\" href=\"https:\/\/dance.csc.ncsu.edu\/papers\/ACSOS22.pdf\">Yuhang Lin, Olufogorehan Tunde-Onadele, Xiaohui Gu, Jingzhu He, and Hugo Latapie,\u00a0SHIL: Self-Supervised Hybrid Learning for Security Attack Detection in Containerized Applications<\/a>&#8220;,<br style=\"font-weight: 400\" \/>Proc. of the 3rd IEEE International Conference on Autonomic Computing and Self-Organizing Systems (<b>ACSOS<\/b>), Los Angeles, CA, September, 2022 &#8211; <strong>Sumeet Bapurao Khillare (skhilla)<\/strong><\/span><\/li>\n<li><span style=\"font-weight: 400\">Yuhang Lin, Olufogorehan Tunde-Onadele, and Xiaohui Gu,&#8221;<a style=\"font-weight: 400\" href=\"https:\/\/dance.csc.ncsu.edu\/papers\/ACSAC20.pdf\">CDL: Classified Distributed Learning for Detecting Security Attacks in Containerized Applications<\/a>&#8220;,\u00a0Proc. of Annual Computer Security Applications Conference (<b>ACSAC<\/b>), Austin, TX, December, 2020. &#8211; <strong>Neel Dudheliya (ndudhel)<\/strong><\/span><\/li>\n<\/ul>\n<\/td>\n<\/tr>\n<tr>\n<td colspan=\"1\" rowspan=\"2\">15<\/td>\n<td>4\/15<\/td>\n<td>Student presentation<\/td>\n<td>\n<ul>\n<li><span style=\"font-weight: 400\">Olufogorehan Tunde-Onadele, Yuhang Lin, Jingzhu He, and Xiaohui Gu,\u00a0&#8220;<\/span><a style=\"font-weight: 400\" href=\"http:\/\/dance.csc.ncsu.edu\/papers\/ACSOS20.pdf\">Self-Patch: Beyond Patch Tuesday for Containerized Applications<\/a><span style=\"font-weight: 400\">&#8220;,<\/span><span style=\"font-weight: 400\">\u00a0Proc. of IEEE International Conference on Autonomic Computing and Self-Organizing Systems (<\/span><b>ACSOS<\/b><span style=\"font-weight: 400\">), Washington, DC, August, 2020, pp. 21-27. &#8211; <strong>Sagar Dama (sudama)<\/strong><\/span><\/li>\n<li><span style=\"font-weight: 400\"><span style=\"font-weight: 300\">Rui Shu et al.,\u00a0<a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/codaspy17.pdf\">A Study of Security Vulnerabilities on Docker Hub<\/a>, Proc. of CODASPY 2017 &#8211; <strong>Shanmukh Pawan Moparthi (smopart2)<\/strong><\/span><\/span><\/li>\n<\/ul>\n<\/td>\n<td colspan=\"1\" rowspan=\"2\">No paper reading assigned. You should spend time on your term projects.<\/td>\n<\/tr>\n<tr>\n<td width=\"5%\">4\/17<\/td>\n<td>Project Demo<\/td>\n<td width=\"15%\">\n<p>&nbsp;<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td rowspan=\"2\" width=\"2%\">16<\/td>\n<td width=\"5%\">4\/22<\/td>\n<td width=\"15%\">Project Demo<\/td>\n<td colspan=\"1\" rowspan=\"1\" width=\"50%\">\n<ol class=\"p-rich_text_list p-rich_text_list__ordered p-rich_text_list--nested\" data-stringify-type=\"ordered-list\" data-list-tree=\"true\" data-indent=\"0\" data-border=\"0\">\n<li data-stringify-indent=\"0\" data-stringify-border=\"0\"><b data-stringify-type=\"bold\">Atomic Transactions in Distributed Key Value Store &#8211; <\/b>\u00a0Sachin R Doddaguni, Samarth Mahesh Shetty<\/li>\n<li><b data-stringify-type=\"bold\">AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service &#8211; <\/b>Anirudh Kaluri, Sagar Dama<\/li>\n<li data-stringify-indent=\"0\" data-stringify-border=\"0\"><b data-stringify-type=\"bold\">Distributed Hash Table (DHT) using Chord &#8211; <\/b>Apurv Choudari, Harikrishnan Venkatesh, Kruthik Jonnagaddala<\/li>\n<li data-stringify-indent=\"0\" data-stringify-border=\"0\"><b data-stringify-type=\"bold\">Improving Chunking Algorithm for BeeGFS &#8211; <\/b>Aryan Gupta, Jayesh Bhagyesh Gajbhar, Tanishq Virendrabhai Todkar<\/li>\n<li data-stringify-indent=\"0\" data-stringify-border=\"0\"><b data-stringify-type=\"bold\">Leveraging Self-Supervised Hybrid Learning for Container Security in Kubernetes: A Detection and Response Framework &#8211; <\/b>Sumeet Bapurao Khillare, Shanmukh Pawan Moparthi, Chirag Bheemaiah Palanganda Karumbaiah<\/li>\n<li data-stringify-indent=\"0\" data-stringify-border=\"0\"><b data-stringify-type=\"bold\">LogPress: Optimized Compression and Retrieval of Unstructured Logs &#8211; <\/b>Neel Dudheliya, Pranav Jibhakate, Tanay Gandhi<\/li>\n<li data-stringify-indent=\"0\" data-stringify-border=\"0\"><b data-stringify-type=\"bold\">SPHERE: Scalable Proactive Handling for Efficient Resource Expansion &#8211; <\/b>Kashika Malick, Rajat Chandak, Shubh Nisar<\/li>\n<li><b data-stringify-type=\"bold\">Unsupervised Model Drift Detection in Federated Learning &#8211;<\/b>\u00a0Tural Mehtiyev<\/li>\n<\/ol>\n<\/td>\n<td colspan=\"1\" rowspan=\"3\" width=\"50%\">\n<p style=\"font-weight: 400\"><strong><b>April 29th midnight: <\/b>Final project report due, project source code and document due<\/strong><\/p>\n<p style=\"font-weight: 400\"><strong>Your project source code and document submission should be\u00a0a single zip file. The zip file should include your system source code including all other dependent packages, the experimental subjects used in the project report, instructions on how to set up and use the system to reproduce the experimental results, and other documents that help others understand your tool and source code.<\/strong><\/p>\n<\/td>\n<\/tr>\n<tr>\n<td>4\/23<\/td>\n<td>\n<p>Project presentation\u00a0<br \/>9am-1pm <br \/>EBII 3211<\/p>\n<\/td>\n<td>\u00a0<\/td>\n<\/tr>\n<tr>\n<td>17<\/td>\n<td>\u00a0<\/td>\n<td>\u00a0<\/td>\n<td>\u00a0<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>Suggested Topics for Student Presentations<\/h3>\n<p>(You can suggest to the instructor the papers that are not in this list but you would like to present):<\/p>\n<p>Please check below for your assigned paper.<\/p>\n<div>\n<h4>AI-Driven Distributed System Management<\/h4>\n<\/div>\n<ol>\n<li><span style=\"font-weight: 400\">&#8220;<\/span><a style=\"font-weight: 400\" href=\"https:\/\/dance.csc.ncsu.edu\/papers\/ACSOS24.pdf\">ClearCausal: Cross Layer Causal Analysis for Automatic Microservice Performance Debugging<\/a><span style=\"font-weight: 400\">&#8220;,<\/span><span style=\"font-weight: 400\">Olufogorehan Tunde-Onadele, Feiran Qin, Xiaohui Gu, Yuhang Lin<\/span><span style=\"font-weight: 400\">, 5th IEEE International Conference on Autonomic Computing and Self-Organizing Systems &#8211; <strong>Apurv Choudhari (apchoudh)<\/strong><\/span><\/li>\n<li><span style=\"font-weight: 400\"><span style=\"font-weight: 400\">Jingzhu He, Ting Dai, Xiaohui Gu, and Guoliang Jin,\u00a0<\/span>&#8220;<\/span><a style=\"font-weight: 400\" href=\"https:\/\/dance.csc.ncsu.edu\/papers\/SOCC20.pdf\">HangFix: Automatically Fixing Software Hang Bugs for Production Cloud Systems<\/a><span style=\"font-weight: 400\">&#8220;,<\/span><span style=\"font-weight: 400\">\u00a0Proc. of ACM Symposium on Cloud Computing (<\/span><b>SOCC<\/b><span style=\"font-weight: 400\">), Renton, WA, October, 2020, pp. 344-357. &#8211; <strong>Tanishq Virendrabhai Todkar (ttodkar)<\/strong><\/span><\/li>\n<li><span style=\"font-weight: 400\"><span style=\"font-weight: 400\">Ting Dai, Jingzhu He, Xiaohui Gu, Shan Lu, and Peipei Wang,\u00a0<\/span>&#8220;<\/span><a style=\"font-weight: 400\" href=\"http:\/\/dance.csc.ncsu.edu\/papers\/SOCC18.pdf\">\u00a0DScope: Detecting Real-World Data Corruption Hang Bugs in Cloud Server Systems<\/a><span style=\"font-weight: 400\">&#8220;,\u00a0<\/span><span style=\"font-weight: 400\">Proc. of ACM Symposium on Cloud Computing (<\/span><b>SOCC<\/b><span style=\"font-weight: 400\">), Carlsbad, CA, October, 2018. &#8211; <strong>Jayesh Bhagyesh Gajbhar (jgajbha)<\/strong><\/span><\/li>\n<li>Daniel Dean, Hiep Nguyen, Xiaohui Gu, Hui Zhang, Junghwan Rhee, Nipun Arora, Geoff Jiang, <a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/socc14.pdf\">PerfScope: Practical Online Server Performance Bug Inference in Production Cloud Computing Infrastructures<\/a>&#8220;, Proc. of SOCC 2014. <strong>&#8211; Kashika Malick (kmalick)<\/strong><\/li>\n<li>Hiep Nguyen, Zhiming Shen, Yongmin Tan, Xiaohui Gu,&#8221;<a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/icdcs2013.pdf\">FChain: Toward Black-box Online Fault Localization for Cloud Systems<\/a>\u201d, Proc. of ICDCS 2013. &#8211; <strong>Shubh Nisar (snisar)<\/strong><\/li>\n<li>Daniel Dean, Hiep Nguyen, Xiaohui Gu, &#8220;<a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/UBL.pdf\">UBL: Unsupervised Behavior Learning for Predicting Performance Anomalies in Virtualized Cloud Systems<\/a>\u201d, Proc. of ACM International Conference on Autonomic Computing (ICAC), San Jose, CA, September, 2012. <strong>&#8211; Rajat Girish Chandak (rchanda3)<\/strong><\/li>\n<li>&#8220;<a class=\"c-link\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/if-at-first-you-dont-succeed-try-try-again-insights-and-llm-informed-tooling-for-detecting-retry-bugs-in-software-systems\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/if-at-first-you-dont-succeed-try-try-again-insights-and-llm-informed-tooling-for-detecting-retry-bugs-in-software-systems\/\" data-sk=\"tooltip_parent\">If At First You Don\u2019t Succeed, Try, Try, Again\u2026? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems<\/a>&#8220;, Bogdan Alexandru Stoica, Utsav Sethi , Yiming Su , Cyrus Zhou, <a class=\"c-link\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/shanlu\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/shanlu\/\" data-sk=\"tooltip_parent\">Shan Lu<\/a>,\u00a0<a class=\"c-link\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/jonathanmace\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/jonathanmace\/\" data-sk=\"tooltip_parent\">Jonathan Mace<\/a>,\u00a0<a class=\"c-link\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/madanm\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/madanm\/\" data-sk=\"tooltip_parent\">Madan Musuvathi<\/a>,\u00a0<a class=\"c-link\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/sumann\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/sumann\/\" data-sk=\"tooltip_parent\">Suman Nath<\/a>, <i data-stringify-type=\"italic\">ACM SOSP 2024<\/i> | November 2024 &#8211; <strong>Tural Mehtiyev (tmehtiy)<\/strong><\/li>\n<\/ol>\n<h4>Cloud Computing &amp; Data Center &amp; Big Data<\/h4>\n<ol>\n<li>\n<p>Mike Chow,\u00a0<a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/chow\">ServiceLab: Preventing Tiny Performance Regressions at Hyperscale through Pre-Production Testing<em>,\u00a0<\/em><\/a>Proc. of OSDI 2024 &#8211; <strong>Harikrishnan Venkatesh (hvenkat2)<\/strong><a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/chow\"><em><br \/><\/em><\/a><\/p>\n<\/li>\n<li><a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/wang-rui\">Rui Wang,\u00a0<em>et al,,\u00a0<\/em><\/a><a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/wang-rui\">\u03bcSlope: High Compression and Fast Search on Semi-Structured Logs,<em>\u00a0Proc. of OSDI 2024<\/em><\/a> &#8211; <strong>Anirudh Kaluri (akaluri)<\/strong><\/li>\n<li><a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/wang-rui\">Y. Sheng et al., Fairness in Serving Large Language Models<em>, Proc. of OSDI 2024<\/em><\/a> &#8211; <strong>Samarth Shetty (sshett22)<\/strong><\/li>\n<li><a href=\"https:\/\/www.usenix.org\/conference\/osdi24\/presentation\/wang-rui\">Philipp Moritz et al., Ray: A Distributed Framework for Emerging AI Applications<em>, Proc. of OSDI 2018<\/em><\/a> &#8211; <strong>Tanay Gandhi (tgandhi)<\/strong><\/li>\n<li><span style=\"font-weight: 300\">Mart\u00edn Abadi et al., <\/span><a href=\"https:\/\/www.usenix.org\/system\/files\/conference\/osdi16\/osdi16-abadi.pdf\">TensorFlow: A System for Large-Scale Machine Learning, <\/a><span style=\"font-weight: 300\"><span style=\"font-weight: 300\">Proc. of OSDI 2016. &#8211; <strong>Chirag Bheemaiah Palanganda Karumbaiah (cpalang)<\/strong><\/span><\/span><\/li>\n<li>Hiep Nguyen, Zhiming Shen, Xiaohui Gu, Sethuraman Subbiah, John Wilkes,&#8221;<a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/icac2013.pdf\">AGILE: elastic distributed resource scaling for Infrastructure-as-a-Service<\/a>&#8220;, Proc. of USENIX International Conference on Autonomic Computing (ICAC), San Jose, CA, June, 2013. &#8211; <strong>Sachin R Doddaguni (srdodda)<\/strong><\/li>\n<li>Zhiming Shen, Sethuraman Subbiah, Xiaohui Gu, and John Wilkes, <a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/socc11-cloudscale.pdf\">CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems<\/a>, Proc. of ACM SOCC 2011. &#8211; <strong>Kruthik Jonnagaddala Thyagaraja (kjonnag)<\/strong><\/li>\n<li>Xiaohui Gu, Klara Nahrstedt, <a href=\"https:\/\/dance.csc.ncsu.edu\/gu\/publications-files\/hpdc02.pdf\"><span class=\"s2\">A Scalable QoS-Aware Service Aggregation Model for Peer-to-Peer Computing Grids<\/span><\/a>, Proc. of IEEE International Symposium on High Performance Distributed Computing (HPDC 2002) &#8211; <strong>Aryan Gupta (agupta72)<\/strong><\/li>\n<\/ol>\n<h4>Distributed Systems Security<\/h4>\n<ol>\n<li><span style=\"font-weight: 400\"><span style=\"font-weight: 400\">Olufogorehan Tunde-Onadele, Yuhang Lin, Xiaohui Gu, and Jingzhu He,\u00a0<\/span>&#8220;<\/span><a style=\"font-weight: 400\" href=\"https:\/\/dance.csc.ncsu.edu\/papers\/IC2E22.pdf\">Understanding Software Security Vulnerabilities in Cloud Server Systems<\/a><span style=\"font-weight: 400\">&#8220;,<\/span><span style=\"font-weight: 400\">\u00a0Proc. of the 10th IEEE International Conference on Cloud Engineering (<\/span><b>IC2E<\/b><span style=\"font-weight: 400\">), Pacific Grove, CA, September, 2022 &#8211; <strong>Pranav Arvind Jibhakate (pjibhak)<\/strong><\/span><\/li>\n<li><span style=\"font-weight: 400\"><a style=\"font-weight: 400\" href=\"https:\/\/dance.csc.ncsu.edu\/papers\/ACSOS22.pdf\"><span style=\"font-weight: 400\">Yuhang Lin, Olufogorehan Tunde-Onadele, Xiaohui Gu, Jingzhu He, and Hugo Latapie,\u00a0<\/span>SHIL: Self-Supervised Hybrid Learning for Security Attack Detection in Containerized Applications<\/a><span style=\"font-weight: 400\">&#8220;,<\/span><br style=\"font-weight: 400\" \/><span style=\"font-weight: 400\">Proc. of the 3rd IEEE International Conference on Autonomic Computing and Self-Organizing Systems (<\/span><b>ACSOS<\/b><span style=\"font-weight: 400\">), Los Angeles, CA, September, 2022 &#8211; <strong>Sumeet Bapurao Khillare (skhilla)<\/strong><\/span><\/span><\/li>\n<li><span style=\"font-weight: 400\"><span style=\"font-weight: 400\"><span style=\"font-weight: 400\"><span style=\"font-weight: 400\">Yuhang Lin, Olufogorehan Tunde-Onadele, and Xiaohui Gu<\/span><span style=\"font-weight: 400\">,<\/span>&#8220;<\/span><a style=\"font-weight: 400\" href=\"https:\/\/dance.csc.ncsu.edu\/papers\/ACSAC20.pdf\">CDL: Classified Distributed Learning for Detecting Security Attacks in Containerized Applications<\/a><span style=\"font-weight: 400\">&#8220;,<\/span><span style=\"font-weight: 400\">\u00a0Proc. of Annual Computer Security Applications Conference (<\/span><b>ACSAC<\/b><span style=\"font-weight: 400\">), Austin, TX, December, 2020. &#8211; <strong>Neel Dudheliya (ndudhel)<\/strong><\/span><\/span><\/span><\/li>\n<li><span style=\"font-weight: 400\"><span style=\"font-weight: 400\">Olufogorehan Tunde-Onadele, Yuhang Lin, Jingzhu He, and Xiaohui Gu,\u00a0<\/span>&#8220;<\/span><a style=\"font-weight: 400\" href=\"http:\/\/dance.csc.ncsu.edu\/papers\/ACSOS20.pdf\">Self-Patch: Beyond Patch Tuesday for Containerized Applications<\/a><span style=\"font-weight: 400\">&#8220;,<\/span><span style=\"font-weight: 400\">\u00a0Proc. of IEEE International Conference on Autonomic Computing and Self-Organizing Systems (<\/span><b>ACSOS<\/b><span style=\"font-weight: 400\">), Washington, DC, August, 2020, pp. 21-27. &#8211; <strong>Sagar Dama (sudama)<\/strong><\/span><\/li>\n<li><span style=\"font-weight: 400\"><span style=\"font-weight: 300\">Rui Shu et al.,\u00a0<a href=\"http:\/\/dance.csc.ncsu.edu\/papers\/codaspy17.pdf\">A Study of Security Vulnerabilities on Docker Hub<\/a>, Proc. of CODASPY 2017 &#8211; <strong>Shanmukh Pawan Moparthi (smopart2)<\/strong><\/span><\/span><\/li>\n<\/ol>\n<h3>Academic Integrity<\/h3>\n<p style=\"font-weight: 400\">The university provides a detailed policy on academic integrity. This policy can be found in the\u00a0<a href=\"http:\/\/www.ncsu.edu\/policies\/student_services\/student_discipline\/POL11.35.1.php\">Code of Student Conduct<\/a>. It is understood that when you submit your homework, you are implicitly agreeing to the university honor pledge: &#8220;I have neither given nor received unauthorized aid on this test or assignment.&#8221;<\/p>\n<p style=\"font-weight: 400\">Academic dishonesty (e.g., cheating or plagiarism) will not be tolerated under any circumstances. If you are having difficulty with any part of the course material, please see me as soon as possible. I will do everything I can to help you with any course-related problems you may be having. If you are found to be guilty of academic dishonesty, however, I will then do everything I can to see that you are punished as forcefully as possible. This may include asking to have you suspended or expelled from the course, the program, and\/or the university. At a minimum, you will receive -50% for the assignment in question, and your name will be placed on record with the university as having committed an academic offence. Multiple offences during your academic career will result in suspension or expulsion from the university. I take absolutely no pleasure in pursuing cases of academic misconduct, and would ask that you please do not put me in this position.<\/p>\n<h3>Students With Disabilities<\/h3>\n<p style=\"font-weight: 400\">All effort will be made to ensure that no students with disabilities are denied any opportunity to successfully complete this course. If you have specific requirements that need to be addressed, please contact me immediately. Possible changes can include (but are not necessarily limited to) rescheduling classes from inaccessible to accessible buildings, or providing access to auxiliary aids such as tape recorders, special lab equipment, or other services such as readers, note takers, or interpreters. This may also include oral or taped tests, readers, scribes, separate testing rooms, or extension of time limits.<\/p>\n<h3>Lab Safety Issues<\/h3>\n<p style=\"font-weight: 400\">None.<\/p>\n<h3>Pass-Through Costs<\/h3>\n<p style=\"font-weight: 400\">None.<\/p>\n\n","protected":false},"excerpt":{"rendered":"<p>Spring 2025Credits:\u00a03Meeting Times: Tuesday\/Thursday, 3:00pm &#8211; 4:15pmMeeting Location: 1212 Engineering Building IIAssignment submission: MoodleMessage board: Piazza \u00a0 Instructor Information Xiaohui (Helen) Gu Office Hours: Tues\/Thurs 3:00pm &#8211; 4:15pm\u00a0 at EBII \u00a03274 Email : xgu AT ncsu.edu Teaching Assistants\/Graders Tanvin Kalra (Grader) Email : tkalra AT ncsu.edu &nbsp; Course Objectives This course explores design and implementation [&hellip;]<\/p>\n","protected":false},"author":3482,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"footnotes":""},"acf":[],"_links":{"self":[{"href":"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-json\/wp\/v2\/pages\/10"}],"collection":[{"href":"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-json\/wp\/v2\/users\/3482"}],"replies":[{"embeddable":true,"href":"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-json\/wp\/v2\/comments?post=10"}],"version-history":[{"count":67,"href":"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-json\/wp\/v2\/pages\/10\/revisions"}],"predecessor-version":[{"id":584,"href":"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-json\/wp\/v2\/pages\/10\/revisions\/584"}],"wp:attachment":[{"href":"https:\/\/wordpress-courses2425.wolfware.ncsu.edu\/csc-724-001-sprg-2025\/wp-json\/wp\/v2\/media?parent=10"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}