Email Accessibility & Reliability - Combining Dovecot and Scality to Win Dan Shain & Jim Perry - Open-Xchange

Page created by Randall Coleman
 
CONTINUE READING
Email Accessibility & Reliability - Combining Dovecot and Scality to Win Dan Shain & Jim Perry - Open-Xchange
Email Accessibility & Reliability
Combining Dovecot and Scality to Win
Dan Shain & Jim Perry
Cloud Office R&D
Email Accessibility & Reliability - Combining Dovecot and Scality to Win Dan Shain & Jim Perry - Open-Xchange
Who are we?
•   Dan Shain
    ‣ Heads Engineering, Development, and
      Operations for Rackspace Applications Services
      Cloud Office Division
    ‣ 30+ years in the IT Realm with focus on large
      distributed systems

•   Jim Perry
    ‣ Technical Manager for Software Development
    ‣ Responsible for all things Dovecot
    ‣ 25 years in the software development and IT
      management
Email Accessibility & Reliability - Combining Dovecot and Scality to Win Dan Shain & Jim Perry - Open-Xchange
IN 2014 WE NEEDED TO
CHANGE OUR OPERATIONAL
        PARADIGM
Email Accessibility & Reliability - Combining Dovecot and Scality to Win Dan Shain & Jim Perry - Open-Xchange
Dilemma We Faced
Open Source Dovecot with filesystem based storage
                                                Current path would increase base product cost by 200%

                            Hardware Projections
    4500

    4000

    3500
                                                                                       CAPEX Costs
    3000

                                                                                        OPEX Costs
    2500

    2000

    1500

                                                                                         Logistical
    1000
                                                                                         Nightmare
     500

       0
                    2015   2016        2017        2018     2019

                                   Status Quo
4          Footer
Email Accessibility & Reliability - Combining Dovecot and Scality to Win Dan Shain & Jim Perry - Open-Xchange
Dovecot Deployment that was…
    4,500+ Assets across 3 Datacenters and in 5 physical zones

• Scaling           •   Compute and Storage could not scale independently
 Limitations        •   Complicated support processes and scripts

• Data Protection   •   N+2 Protection for Geo-redundant failures was not reliable and very costly

• Environment       •   Hardware management was a logistical nightmare

• Management        •   System Rebalancing was Managed Process

• Product           •   On boarding limitations caused by write width
 Impacts            •   Cost basis rising

   5
Email Accessibility & Reliability - Combining Dovecot and Scality to Win Dan Shain & Jim Perry - Open-Xchange
Our Use Case
Wide range of uses on a multi-tenant platform

                       •        Multi-tenant environment > 800,000 Domains
                       •        Wide variety of “Domain” level settings
                       •        Usage patterns, although normalized, have large variances

•           Normal Does not cause problems, “Abby Normal” does… we have extremes…that break the normal

                                Mailbox Distribution By                                                                    Average Messages                                                                      Accessed Everywhere
                                   Message Count                                                                      Sent/Received by Mailbox/Day
Mailbox Count

                                                                                                                                                                                    Sent

                                                                                                                                                                                    Received

                                                                                                              1   2    10000
                                                                                                                                                                                                 10,000
                < 10    < 100    < 500   < 5,000   > 3,000   > 20,000   > 200,000   > 1,000,000 > 2,000,000

6
                       Messages per Mailbox
Demands
       Product Team/Customers                     Engineering/Operations

o Happy Customers                         o Stay abreast of Vulnerabilities
o Improved cost margins                   o Reduce Operational Costs to run and
o Value Added Customer Support Contacts     maintain the business
 o No Down Time                           o Stabilize Capital Expenditures
 o Ensure Deliverability                  o Ease Supply Train Impacts
 o Zero Data Loss                         o Reduce/optimize Datacenter Space
o Faster Feature Delivery                 o Move to virtualized deployment models

7
DovecotPro – Scality Solution                                             Things WILL Fail
Why did we chose this solution?

    ‣ Access Patterns
                                                                               Disk
      •   Very High Write, High Read, Medium Delete
      •   Data after 10 days, stays… and stays                                 Node
    ‣ Customer Activity Profile
                                                                               Rack
      •   Business Class Usage (Extremely High Daily Active Percentage)
    ‣ N+2 Architecture                                                         Row
      •   Need multi-zone and Geo-Graphic managed solution
                                                                              Network
    ‣ Fault Segmentation (Hierarchy of failure events)
    ‣ Scaling                                                               Datacenter
      •   Must scale compute and storage separately
    ‣ Maintenance & Upgrade
    ‣ Knowledge of Dovecot
    ‣ Reputation of Vendors

8
Deployment                                  Data Center                             Data Center
Chosen
                                                         DoveCot Pro                   DoveCot Pro
                                                          Backend                       Backend

                           Dovecot/Scality Multi-Geo
                                                          Load Balancer                Load Balancer
                                                         (Local haproxy)              (Local haproxy)

                           Location Deployment
                   Cass andra                                                                            Cass andra

                                                                   sproxyd

                                                                                                                  Availability Zone I
     Availability Zone I
                                                               Scality Ring – Stretched over
                                                                       2 Data Centers                            Availability Zone II
     Availability Zone II

                   Cass andra                                                                               Cass andra
                                                          Load Balancer                 Load Balancer
                                                         (Local haproxy)               (Local haproxy)

                                                       DoveCot Pro                   DoveCot Pro
                                                        Backend                       Backend
Optimized Scality
Infrastructure
• Leverage Scality to Optimize Hardware
     ‣ Apollo 4510 platform
     ‣ Tremendous scaling options
     ‣ Gigabyte Cost optimization
     ‣ Normalized and consistent environment

• Key – We worked with Scality to
     create an optimized “Email
     Storage Ring”
     ‣ Focus on the nature of email access patterns as
       well as sizing
     ‣ Do not overload the use case (We are not
       creating an S3 like environment)

10   OX Summit
Footprint Reduction                                                                                   1400
                                                                                                                                            Hardware Repairs

• Over 2,000 Assets Decommissioned to date                                                            1200

• Avoided ordering over 1,440 Assets and opening 1 new server room                                    1000

• Vacating several rooms                                                                               800

• Reduced Maintenance to less than 35% of Peak (and dropping)                                          600

                                           Progress to Date
                                                                                                       400

2500
                                                                                                       200

                                                                                                         0
                                                                                                             Q1 2016   Q2 2016   Q3 2016   Q4 2016   Q1 2017   Q2 2017   Q3 2017   Q4 2017   Q1 2018
2000

1500

1000

 500

   0
       Sep-16   Dec-16   Mar-17   Jun-17    Sep-17       Dec-17   Mar-18   Jun-18   Sep-18   Dec-18
Storage Stability
• Scality survived hardware outages; Disk and Chassis
• Handled Massive Cassandra Rebuild Efforts
• Rack Moves & Room Reconfigurations
• Only 1 Scality Ring associated customer impacting event
 ‣ Root Cause related to HP Firmware defect
• Only a handful of special tuning needed to run to:
 ‣ Re-engage disks post hardware oddness
 ‣ Node release of expired data
• Expansion Scheduled for this Summer
CHALLENGES FACED
     ‣ Interplay between DovecotPro, Cassandra
       & Scality
     ‣ Early adoption woes                         “Optimist: Someone
     ‣ Initial errors in Dovecot Director Ring
       Configuration
                                                   who figures that taking
     ‣ Fault Segmentation                          a step backward
     ‣ Too many customized plugins on our behalf   after taking a step
     ‣ Need to improve path to upgrade             forward is not a
     ‣ Allowance needed for “Bake-In”              disaster, it's a cha-cha.”
     ‣ Tuning of the “Ring” – Overhead Reduction
     ‣ Expanded our internal knowledge base
                                                   - Robert Brault
13
THANK YOU
    ONE FANATICAL PLACE                        | SAN ANTONIO, TX 78218

    US SALES: 1-800-961-2888 | US SUPPORT: 1-800-961-4454 | WWW.RACKSPACE.COM

Copyright © 2018 Rackspace | Rackspace® Fanatical Support® and other Rackspace marks are either registered service marks or service marks of Rackspace US, Inc. in the United States and other countries. Features, benefits and pricing
        presented depend on system configuration and are subject to change without notice. Rackspace disclaims any representation, warranty or other legal commitment regarding its services except for those expressly stated
                 in a Rackspace services agreement. All other trademarks, service marks, images, products and brands
                                                                                                               14 remain the sole property of their respective holders and do not imply endorsement or sponsorship.
You can also read