|
We have created Zap, a novel system for transparent migration of
legacy and networked applications. Zap provides a thin virtualization
layer on top of the operating system that introduces pods, which are
groups of processes that are provided a consistent, virtualized view
of the system. This decouples processes in pods from dependencies to
the host operating system and other processes on the system. By
integrating Zap virtualization with a checkpoint-restart mechanism,
Zap can migrate a pod of processes as a unit among machines running
independent operating systems without leaving behind any residual
state after migration.
We have implemented a
Zap prototype in Linux that supports transparent
migration of unmodified applications without any kernel modifications.
Our Linux Zap system extends a novel checkpoint / restart mechanism
implementation from our earlier work on CRAK, a system that provided
process Checkpoint and Restart As a Kernel Module for Linux. We
demonstrate that our Linux Zap prototype can provide general-purpose
process migration functionality with low overhead. Our experimental
results for migrating pods used for running a standard user's X
windows desktop computing environment and for running an Apache web
server show that these kinds of pods can be migrated with subsecond
checkpoint and restart latencies.
More Information:
- Kernel-based
checkpoint
and restart, LWN.net, August 2008.
- Stelios Sidiroglou, Oren Laadan, Carlos R. Pérez, Nicolas
Viennot, Jason Nieh, and Angelos D. Keromytis, "ASSURE:
Automatic Software Self-healing Using REscue points", Proceedings
of the 14th International Conference on Architectural Support for
Programming Languages and Operating Systems (ASPLOS 2009),
Washington, DC, March 7-11, 2009.
- Shaya Potter, Jason Nieh, and Matthew Selsky, "Secure
Isolation of Untrusted Legacy Applications", Proceedings of
the Twenty-first Large Installation System Administration Conference
(LISA 2007), Dallas, TX, November 11-16, 2007, pp. 117-130.
- Oren Laadan, Ricardo Baratto, Dan Phung, Shaya Potter, and
Jason Nieh, "DejaView:
A Personal Virtual Computer Recorder", Proceedings of the
Twenty-first ACM Symposium on Operating Systems Principles (SOSP 2007),
Stevenson, WA, October 14-17, 2007, pp. 279-292.
- Oren Laadan and Jason Nieh, "Transparent
Checkpoint-Restart of Multiple Processes on Commodity Operating Systems",
Proceedings of the 2007 USENIX Annual Technical
Conference, Santa Clara, CA, June 17-22, 2007, pp. 323-336.
- Stelios Sidiroglou, Oren Laadan, Angelos D. Keromytis, and
Jason Nieh, "Using
Rescue Points to Navigate Software Recovery (Short Paper)", Proceedings
of the IEEE Symposium on Security and Privacy, Oakland, CA, May
20-23, 2007, pp. 273-280.
- Shaya Potter and Jason Nieh, "Highly
Reliable Mobile Desktop Computing in Your Pocket", Proceedings
of the IEEE Computer Society Signature Conference on Software
Technology and Applications (COMPSAC 2006), Chicago, IL,
September 18-21, 2006, pp. 247-254.
- Shaya Potter and Jason Nieh, "Reducing
Downtime Due to System Maintenance and Upgrades", Proceedings
of the Nineteenth Large Installation System Administration Conference
(LISA 2005), San Diego, CA, December 4-9, 2005, pp. 47-62. (Best
Student Paper Award)
- Shaya Potter and Jason Nieh, "Breaking
the Ties that Bind: Process Isolation and Migration", ;login,
USENIX Association, 30(6), December 2005, pp. 14-17.
- Oren Laadan, Dan Phung, and Jason Nieh, "Transparent
Checkpoint-Restart of Distributed Applications on Commodity Clusters",
Proceedings of the 2005 IEEE International Conference
on Cluster Computing (Cluster 2005), Boston, MA, September
27-30, 2005.
- Shaya Potter and Jason Nieh, "AutoPod:
Unscheduled System Updates with Zero Data Loss", Abstract in Proceedings
of the Second IEEE International Conference on Autonomic Computing
(ICAC 2005), Seattle, WA, June 13-16, 2005, pp. 367-368.
- Shaya Potter and Jason Nieh, "WebPod:
Persistent Web Browsing Sessions with Pocketable Storage Devices", Proceedings
of the Fourteenth International World Wide Web Conference (WWW 2005),
Chiba, Japan, May 10-14, 2005, pp. 603-612.
- Angelos Stavrou, Angelos D. Keromytis, Jason Nieh, Vishal
Misra, and Dan Rubenstein, "MOVE:
An End-to-End Solution To Network Denial of Service", Proceedings
of the Twelfth Annual Network and Distributed System Security Symposium
(NDSS 2005), San Diego, CA, February 2-4, 2005, pp. 81-96.
- Shaya Potter and Jason Nieh, "WebPod:
Persistent Web Browsing Sessions with Pocketable Storage Devices",
Technical Report CUCS-047-04, Department of Computer Science, Columbia
University, November 2004.
- Gong Su, "MOVE:
Mobility with Persistent Network Connections", Ph.D. Thesis,
Department of Computer Science, Columbia University, October 2004.
- Ricardo Baratto, Shaya Potter, Gong Su, and Jason Nieh, "MobiDesk:
Mobile Virtual Desktop Computing", Proceedings of the Tenth
Annual ACM International Conference on Mobile Computing and Networking
(MobiCom 2004), Philadelphia, PA, September 26-October 1, 2004,
pp. 1-15. (Best Student Paper Award)
- Shaya Potter, Jason Nieh, and Dinesh Subhraveti, "Secure
Isolation and Migration of Untrusted Legacy Applications",
Technical Report CUCS-005-04, Department of Computer Science, Columbia
University, January 2004.
- Angelos D. Keromytis, Janak Parekh, Philip N. Gross, Gail
Kaiser, Vishal Misra, Jason Nieh, Dan Rubenstein, and Sal Stolfo, "A
Holistic Approach to Service Survivability", Proceedings of
the 2003 ACM Workshop on Survivable and Self-Regenerative Systems,
Fairfax, VA, October 31, 2003, pp. 11-22.
- Angelos D. Keromytis, Janak Parekh, Philip N. Gross, Gail
Kaiser, Vishal Misra, Jason Nieh, Dan Rubenstein, and Sal Stolfo, "A
Holistic Approach to Service Survivability", Technical Report
CUCS-021-03, Department of Computer Science, Columbia University, July
2003.
- Steven Osman, Dinesh Subhraveti, Gong Su, and Jason Nieh, "The
Design and Implementation of Zap: A System for Migrating Computing
Environments", Proceedings of the Fifth Symposium on
Operating Systems Design and Implementation (OSDI 2002), Boston,
MA, December 9-11, 2002, pp. 361-376.
- Gong Su and Jason Nieh, "Mobile
Communication with Virtual Network Address Translation", Technical
Report CUCS-003-02, Department of Computer Science, Columbia
University, February 2002.
- Hua Zhong and Jason Nieh, "CRAK:
Linux Checkpoint / Restart As a Kernel Module", Technical Report
CUCS-014-01, Department of Computer Science, Columbia University,
November 2001.
- CRAK source code for the kernel module and some utilities
for Linux 2.2.19,
2.4.4
(beta), and 2.4.4
with socket support (alpha).
- Zap
Developers Mailing List. Open to Zap and CRAK developers to
subscribe to discuss Zap and CRAK development and deployment issues.
|