fbpx
spk-logo-tm-2023
0%
1-888-310-4540 (main) / 1-888-707-6150 (support) info@spkaa.com
Select Page

A Quick Start Guide to Installing Ceph

windchill features best plm software
Written by SPK Blog Post
Published on October 28, 2013

This seems to be the age of “Big Data”. Every sector seems to have a need for it — from biotechs doing genome sequencing, to financial providers mining market data. For many, the ability to store massive amounts of structured or unstructured data is the key to success, and accessing that data quickly is just as important. Traditionally, centralized storage was the go-to solution. You invested in an expensive Storage Area Network, and in return, it provided excellent performance and scalability.

From a small business perspective, a traditional SAN presents several challenges:

  1. Cost – A SAN is typically composed of a storage controller, some disk shelves, and a separate fiber channel network. Then you have some not-so obvious costs like SAN software & licensing (management software, replication software), and HBA costs.
  2. Administrative overhead – Ethernet switching and routing is ubiquitous. Fiber channel on the other hand, requires experience with FC switches, zoning, multipathing, etc. You’d be best suited to hiring a dedicated storage administrator.
  3. Scaling with respect to cost – You invest in the SAN equipment, increase your compute capacity, purchase more SAN equipment, rinse, repeat. As you grow your SAN, how do you plan for upgrades? How do you justify eventual forklifts?

Enter the new era – distributed filesystems. Ok, perhaps this isn’t so new. Google developed their own in house proprietary filesystem years ago, called BigFiles. It was designed to run on commodity servers, be resilient (since it runs on commodity servers), and perform well. No HBAs, no separate fiber infrastructure, no costly SAN. The idea is that as one unit of computing is added, you get an additional spindle or two of IO – so performance is linear as you scale.

Several open source distributed filesystems have gained in popularity recently. One of which I’ll be discussing today is Ceph. Developed as a drop-in replacement for Hadoop’s distributed filesystem, I’ll show you how you can quickly deploy it to serve as your primary storage. And even if you’re not sequencing any genomes, you likely either have; a VMware cluster, an Exchange installation, or users who simply like to store lots of files. Any of these situations can reap the benefits.

Click here to download our Quick Start Guide to Installing Ceph.

Latest White Papers

Is Your CAD System Letting You Down?

Is Your CAD System Letting You Down?

When you outgrow your CAD system, it is time to upgrade to PTC Creo. Dive into this downloadable eBook to explore how one of the best CAD solutions on the market can change your product design for the better.What You Will Learn Discover how Creo users benefit from the...

Related Resources

The Journey of Software Integration

The Journey of Software Integration

When sharing multiple software across companies, it is important to ensure secure and seamless data delivery. Discover how Exalate helps integrate tools to enable a successful software integration journey.What You Will Learn In this eBook, you will explore: What a...

Why Teams Are Replacing HP ALM with Jira

Why Teams Are Replacing HP ALM with Jira

For years, HP Application Lifecycle Management (formerly known as Quality Center) was a go-to solution for requirements, testing, and defect management. However, with product transitions and support timelines expiring, many organizations are reevaluating their...

Google Workspace vs. Microsoft 365: Which One Is Right for Your Team?

Google Workspace vs. Microsoft 365: Which One Is Right for Your Team?

When it comes to powering modern workplaces, two productivity giants dominate the landscape: Google Workspace and Microsoft 365. Both offer robust suites of collaboration, communication, and productivity tools. But deciding which platform is best for your team depends...