Syndicate: Democratizing Cloud Storage and Caching through Service Composition Jude Nelson Princeton University

The cloud is changing the way we share data. We can keep data on local workstations and file servers for quick access, but face the challenge of sharing it with a large number of people. Alternatively, we can put our data into one or more cloud storage systems to share it with many other users, but then we cannot access it as quickly or as cheaply. Moreover, local copies of data can get out of sync with cloud copies, causing remote users to see old versions. Our solution is Syndicate, a virtual cloud storage system that composes local storage, cloud storage, and commodity CDNs and network caches to transparently give users the best of both worlds. Syndicate organizes a collection of data spread across multiple clouds and end-user computers into a Volume. Using Syndicate, a user sees and interacts with all data in a Volume as if it were a set of files and directories on local storage. All the while, Syndicate lets a scalable number of users read and write the data, ensures that they see a consistent view of the data, and keeps the data durable by uploading it to existing cloud storage providers. To use Syndicate, a user installs a Syndicate Gateway (SG) on their local workstation or cluster. The SG presents all the files in a Volume (Figure 1), while serving locally-written data to a scalable number of SGs via existing, unmodified network caches and CDNs. They coordinate via a scalable Metadata Service (MS) to help them discover data, mask failures, and read fresh data from caches, even if some caches serve stale data. Additional SGs can be configured to archive a Volume’s data to one or more cloud storage services; Ama-

c 2013 by the Association for Computing Machinery, Inc. Copyright (ACM). Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author. SoCC’13, 1–3 Oct. 2013, Santa Clara, California, USA. ACM 978-1-4503-2428-1. http://dx.doi.org/10.1145/2523616.2525954

Larry Peterson Princeton University

Shared# Volume#

SG#

SG#

SG#

Metadata# Service# Caches'+'Request'Routers' (CDN)'

SG#

S3#

SG# SG# Drop Box#

Exis6ng' Data'Set'

Figure 1: Syndicate components run on top of existing network caches and cloud storage systems, making far-flung data appear as a locally-mounted Volume.

zon S3 and Glacier [1], DropBox [4], Box.net [2], and Google Drive [6] are currently supported. Whenever a user writes data, cloud-facing SGs receive and upload it to their corresponding cloud storage providers. Then, if later one or more of the user-facing SGs fails, Syndicate is still able to read the data from one of these cloudfacing SGs. Sometimes users need to work with an existing, remotely-hosted dataset, such as GenBank [5], CommonCrawl [3], M-Lab measurements [7], and so on. To allow this, an SG may also be configured to expose an existing dataset within a Volume as a read-only directory hierarchy. Syndicate downloads dataset records on-demand via these SGs, thereby transparently leveraging existing caches to scale read delivery. By deploying the appropriate SGs, developers can create their own virtual storage systems layered “on top” of existing providers, letting them meet cost, performance, consistency, and durability requirements independent of the underlying implementations.

References [1] Overview of Amazon Web Services. https: //d36cz9buwru1tt.cloudfront.net/ AWS_Overview.pdf. [2] Box.net. http://www.box.net/. [3] Common Crawl. com/.

https://drive.google.

[4] Dropbox. http://www.dropbox.com/. [5] GenBank. http://www.ncbi.nlm.nih. gov/genbank/. [6] Google Drive. measurementlab.net/.

http://www.

[7] Measurement Lab. measurementlab.net/.

http://www.

Syndicate: Democratizing Cloud Storage and Caching ...

remotely-hosted dataset, such as GenBank [5], Common-. Crawl [3], M-Lab measurements [7], and so on. To allow this, an SG may also be configured to expose an existing dataset within a Volume as a read-only directory hier- archy. Syndicate downloads dataset records on-demand via these SGs, thereby transparently ...

330KB Sizes 0 Downloads 149 Views

Recommend Documents

Google Cloud Storage Cloud Platform
Store application data Google Cloud Storage provides fast access to application data, such as images for a photo editing app. • Share data with colleagues and ...

Storage Architecture and Challenges Cloud Platform
Jul 29, 2010 - A typical warehouse-scale computer: 10,000+ ... 2-10%: disk drive annualized failure rate .... Roger wants to see data on laptop from phone.

Bankshot: Caching Slow Storage in Fast Non-Volatile ...
deterministic replacement policy that the cache manager can im- plement purely in software. Even though libBankshot is aware of per process data access pattern, gathering access information from userspace is unsafe. Implementing usage-aware caching p

CLOUD STORAGE FORENSICS.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. CLOUD ...

A Secured Cost-effective Multi-Cloud Storage in Cloud Computing ...
service business model known as cloud computing. Cloud data storage redefines the security issues targeted on customer's outsourced data (data that is not ...

Secure overlay cloud storage with access control and ...
We design and implement FADE, a secure overlay cloud storage system that achieves ... a more fine-grained approach called policy based file assured deletion, ...

Arrow Syndicate
Telephone 886.2.2181.8888 ¡ Facsim ile 886.2.8501.1691. Hong Kong ... All inform ation hereunder does not constitute a solicitation to buy or sell any securities ...

Enabling Data Storage Security in Cloud Computing for ... - wseas.us
important aspect of quality of service, Cloud. Computing inevitably poses ... also proposed distributed protocols [8]-[10] for ensuring storage .... Best practices for managing trust in private clouds ... information they're hosting on behalf of thei

storage as a service in cloud computing pdf
storage as a service in cloud computing pdf. storage as a service in cloud computing pdf. Open. Extract. Open with. Sign In. Main menu.

DRESS Codes for the Storage Cloud: Simple ...
“cloud storage”, use a large number of individually unreliable storage nodes ..... The simulations results indicate the advantages of abandon- ..... Computing, pp.

'Zoolz Business Terabyte Cloud Storage (5 TB ...
#996>Download: 'Zoolz Business Terabyte Cloud Storage (5 TB) - Unlimited. Users/Servers' by Genie9 Cracked Version. What's up, and you are welcome to my ...

Implementing Cloud Storage Metrics to Improve IT ... - Media11
As we migrate to the private cloud, we seek to optimize the efficiency of the SAN environment by increasing storage capacity utilization. Demand for storage ...

Syndicate Memeber list.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Syndicate ...

data integrity proofs in cloud storage pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. data integrity ...

An Approach For Integrity Verification In Multi Cloud Storage ... - IJRIT
using virtual infrastructure management (VIM) , a multi-cloud allows clients to easily ... These tools help cloud providers construct a distributed cloud storage ...

Migrating to Google Cloud Storage From Amazon Glacier White Paper
Jul 23, 2015 - 2. Copy the data to a bucket using the Nearline Storage class ...... [1] http://docs.aws.amazon.com/cli/latest/reference/glacier/initiate-job.html. [2] ...

Implementing Cloud Storage Metrics to Improve IT ... - Media11
Implementing Cloud Storage Metrics to Improve. IT Efficiency and Capacity Management. IT@Intel White Paper. Intel IT. IT Best Practices. Cloud Computing and ...

Privacy Preserving Public Auditing for Secure Cloud Storage Using TPA
the task of allowing a third party auditor (TPA), on behalf of the cloud client, to verify the integrity of the dynamic data stored in the cloud. To securely introduce an ...

2014 (Computing Reviews) Book review - Cloud storage forensics ...
2014 (Computing Reviews) Book review - Cloud storage forensics [David B. Henderson].pdf. 2014 (Computing Reviews) Book review - Cloud storage forensics ...

Book review - Cloud storage forensics [ Mariam Kiran].pdf
Discussing with appropriate use cases, the authors also analyze some open-source ... The book gives great research findings on file contents, software and ...