Evaluating Kubernetes Pod Scaling Techniques for Event-Driven Applications
Keywords:
Kubernetes, Horizontal Pod AutoscalingAbstract
As cloud-native architectures continue to gain momentum, Kubernetes has become a cornerstone for managing containerized applications, offering flexibility and scalability. Among the various types of workloads, event-driven applications, which respond to asynchronous events, present unique challenges due to their dynamic and unpredictable nature. These applications require high elasticity to efficiently manage fluctuating workloads, scaling up during bursts of activity and scaling down during periods of low demand. Kubernetes provides a range of pod scaling techniques, two of the most prominent being Horizontal Pod Autoscaling (HPA) & custom metrics-based scaling. HPA, which automatically adjusts the number of pods based on resource usage such as CPU and memory, is well-suited for applications with predictable scaling needs. However, event-driven systems often experience irregular load patterns, making HPA’s reliance on standard metrics less ideal for every scenario. In these cases, custom metrics-based scaling offers a more precise solution by allowing users to scale based on specific metrics such as event queue length or application-specific performance indicators, ensuring that the system can handle workloads more efficiently. Amazon Elastic Kubernetes Service (EKS), which provides a fully managed Kubernetes environment, supports both of these scaling methods, enabling users to leverage the full capabilities of Kubernetes for event-driven workloads. The choice between HPA and custom metrics-based scaling depends on the specific requirements of the event-driven application, including factors such as the type of events processed, the frequency of events, and the need for rapid scaling. HPA offers simplicity and ease of use, but its reliance on basic resource metrics can lead to under or over-scaling in more complex use cases. On the other hand, custom metrics scaling provides more granular control but requires additional configuration and monitoring. This paper evaluates the strengths & limitations of these pod scaling techniques within EKS for event-driven applications, giving practical insights into which method is best suited for different workloads.
Downloads
References
Mohanty, S. (2018). Evaluation of serverless computing frameworks based on kubernetes (Master's thesis).
Trnkoczy, J., Pašcinski, U., Gec, S., & Stankovski, V. (2017, September). SWITCH-ing from multi-tenant to event-driven videoconferencing services. In 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems (FAS* W) (pp. 219-226). IEEE.
Chelliah, P. R., Naithani, S., & Singh, S. (2018). Practical Site Reliability Engineering: Automate the process of designing, developing, and delivering highly reliable apps and services with SRE. Packt Publishing Ltd.
Gjorgjeski, N., & Jurič, M. (2016). Complex event processing for integration of internet of things devices (Doctoral dissertation, Bachelor’s thesis: Undergraduate university study programme computer and information science).
Lee, H., Satyam, K., & Fox, G. (2018, July). Evaluation of production serverless computing environments. In 2018 IEEE 11th International Conference on Cloud Computing (CLOUD) (pp. 442-450). IEEE.
Pérez, A., Moltó, G., Caballer, M., & Calatrava, A. (2018). Serverless computing for container-based architectures. Future Generation Computer Systems, 83, 50-59.
Bila, N., Dettori, P., Kanso, A., Watanabe, Y., & Youssef, A. (2017, June). Leveraging the serverless architecture for securing linux containers. In 2017 IEEE 37th International Conference on Distributed Computing Systems Workshops (ICDCSW) (pp. 401-404). IEEE.
Lv, K., Zhao, Z., Rao, R., Hong, P., & Zhang, X. (2016, December). PCCTE: A portable component conformance test environment based on container cloud for avionics software development. In 2016 International Conference on Progress in Informatics and Computing (PIC) (pp. 664-668). IEEE.
Nadgowda, S., Suneja, S., & Isci, C. (2017). Paracloud: bringing application insight into cloud operations. In 9th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 17).
Soltani, B., Ghenai, A., & Zeghib, N. (2018). Towards distributed containerized serverless architecture in multi cloud environment. Procedia computer science, 134, 121-128.
Manchana, R. (2017). Optimizing Material Management through Advanced System Integration, Control Bus, and Scalable Architecture. International Journal of Scientific Research and Engineering Trends, 3, 239-246.
Baldini, I., Cheng, P., Fink, S. J., Mitchell, N., Muthusamy, V., Rabbah, R., ... & Tardieu, O. (2017, October). The serverless trilemma: Function composition for serverless computing. In Proceedings of the 2017 ACM SIGPLAN International Symposium on New Ideas, New Paradigms, and Reflections on Programming and Software (pp. 89-103).
Paščinski, U., Trnkoczy, J., Stankovski, V., Cigale, M., & Gec, S. (2018). QoS-Aware Orchestration of Network Intensive Software Utilities within Software Defined Data Centres: An Architecture and Implementation of a Global Cluster Manager. Journal of Grid Computing, 16, 85-112.
Kumar, M. (2018). Serverless computing for the Internet of Things.
Simioni, A. (2017). Implementation and evaluation of a container-based software architecture.
Komandla, V. Transforming Financial Interactions: Best Practices for Mobile Banking App Design and Functionality to Boost User Engagement and Satisfaction.
Gade, K. R. (2018). Real-Time Analytics: Challenges and Opportunities. Innovative Computer Sciences Journal, 4(1).
Gade, K. R. (2017). Integrations: ETL vs. ELT: Comparative analysis and best practices. Innovative Computer Sciences Journal, 3(1).
Naresh Dulam. The Rise of Kubernetes: Managing Containers in Distributed Systems. Distributed Learning and Broad Applications in Scientific Research, vol. 1, July 2015, pp. 73-94
Naresh Dulam. DataOps: Streamlining Data Management for Big Data and Analytics . Distributed Learning and Broad Applications in Scientific Research, vol. 2, Oct. 2016, pp. 28-50
Naresh Dulam, et al. Kubernetes Gains Traction: Orchestrating Data Workloads. Distributed Learning and Broad Applications in Scientific Research, vol. 3, May 2017, pp. 69-93
Naresh Dulam, et al. Snowflake Vs Redshift: Which Cloud Data Warehouse Is Right for You? . Distributed Learning and Broad Applications in Scientific Research, vol. 4, Oct. 2018, pp. 221-40
Downloads
Published
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
License Terms
Ownership and Licensing:
Authors of research papers submitted to Distributed Learning and Broad Applications in Scientific Research retain the copyright of their work while granting the journal certain rights. Authors maintain ownership of the copyright and have granted the journal a right of first publication. Simultaneously, authors agree to license their research papers under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License.
License Permissions:
Under the CC BY-NC-SA 4.0 License, others are permitted to share and adapt the work, as long as proper attribution is given to the authors and acknowledgement is made of the initial publication in the journal. This license allows for the broad dissemination and utilization of research papers.
Additional Distribution Arrangements:
Authors are free to enter into separate contractual arrangements for the non-exclusive distribution of the journal's published version of the work. This may include posting the work to institutional repositories, publishing it in journals or books, or other forms of dissemination. In such cases, authors are requested to acknowledge the initial publication of the work in this journal.
Online Posting:
Authors are encouraged to share their work online, including in institutional repositories, disciplinary repositories, or on their personal websites. This permission applies both prior to and during the submission process to the journal. Online sharing enhances the visibility and accessibility of the research papers.
Responsibility and Liability:
Authors are responsible for ensuring that their research papers do not infringe upon the copyright, privacy, or other rights of any third party. Scientific Research Canada disclaims any liability or responsibility for any copyright infringement or violation of third-party rights in the research papers.
If you have any questions or concerns regarding these license terms, please contact us at editor@dlabi.org.