Evaluating Kubernetes Pod Scaling Techniques for Event-Driven Applications

Authors

  • Babulal Shaik Cloud Solutions Architect at Amazon Web Services, USA Author

Keywords:

Kubernetes, Horizontal Pod Autoscaling

Abstract

As cloud-native architectures continue to gain momentum, Kubernetes has become a cornerstone for managing containerized applications, offering flexibility and scalability. Among the various types of workloads, event-driven applications, which respond to asynchronous events, present unique challenges due to their dynamic and unpredictable nature. These applications require high elasticity to efficiently manage fluctuating workloads, scaling up during bursts of activity and scaling down during periods of low demand. Kubernetes provides a range of pod scaling techniques, two of the most prominent being Horizontal Pod Autoscaling (HPA) & custom metrics-based scaling. HPA, which automatically adjusts the number of pods based on resource usage such as CPU and memory, is well-suited for applications with predictable scaling needs. However, event-driven systems often experience irregular load patterns, making HPA’s reliance on standard metrics less ideal for every scenario. In these cases, custom metrics-based scaling offers a more precise solution by allowing users to scale based on specific metrics such as event queue length or application-specific performance indicators, ensuring that the system can handle workloads more efficiently. Amazon Elastic Kubernetes Service (EKS), which provides a fully managed Kubernetes environment, supports both of these scaling methods, enabling users to leverage the full capabilities of Kubernetes for event-driven workloads. The choice between HPA and custom metrics-based scaling depends on the specific requirements of the event-driven application, including factors such as the type of events processed, the frequency of events, and the need for rapid scaling. HPA offers simplicity and ease of use, but its reliance on basic resource metrics can lead to under or over-scaling in more complex use cases. On the other hand, custom metrics scaling provides more granular control but requires additional configuration and monitoring. This paper evaluates the strengths & limitations of these pod scaling techniques within EKS for event-driven applications, giving practical insights into which method is best suited for different workloads.

Downloads

Download data is not yet available.

References

Mohanty, S. (2018). Evaluation of serverless computing frameworks based on kubernetes (Master's thesis).

Trnkoczy, J., Pašcinski, U., Gec, S., & Stankovski, V. (2017, September). SWITCH-ing from multi-tenant to event-driven videoconferencing services. In 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems (FAS* W) (pp. 219-226). IEEE.

Chelliah, P. R., Naithani, S., & Singh, S. (2018). Practical Site Reliability Engineering: Automate the process of designing, developing, and delivering highly reliable apps and services with SRE. Packt Publishing Ltd.

Gjorgjeski, N., & Jurič, M. (2016). Complex event processing for integration of internet of things devices (Doctoral dissertation, Bachelor’s thesis: Undergraduate university study programme computer and information science).

Lee, H., Satyam, K., & Fox, G. (2018, July). Evaluation of production serverless computing environments. In 2018 IEEE 11th International Conference on Cloud Computing (CLOUD) (pp. 442-450). IEEE.

Pérez, A., Moltó, G., Caballer, M., & Calatrava, A. (2018). Serverless computing for container-based architectures. Future Generation Computer Systems, 83, 50-59.

Bila, N., Dettori, P., Kanso, A., Watanabe, Y., & Youssef, A. (2017, June). Leveraging the serverless architecture for securing linux containers. In 2017 IEEE 37th International Conference on Distributed Computing Systems Workshops (ICDCSW) (pp. 401-404). IEEE.

Lv, K., Zhao, Z., Rao, R., Hong, P., & Zhang, X. (2016, December). PCCTE: A portable component conformance test environment based on container cloud for avionics software development. In 2016 International Conference on Progress in Informatics and Computing (PIC) (pp. 664-668). IEEE.

Nadgowda, S., Suneja, S., & Isci, C. (2017). Paracloud: bringing application insight into cloud operations. In 9th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 17).

Soltani, B., Ghenai, A., & Zeghib, N. (2018). Towards distributed containerized serverless architecture in multi cloud environment. Procedia computer science, 134, 121-128.

Manchana, R. (2017). Optimizing Material Management through Advanced System Integration, Control Bus, and Scalable Architecture. International Journal of Scientific Research and Engineering Trends, 3, 239-246.

Baldini, I., Cheng, P., Fink, S. J., Mitchell, N., Muthusamy, V., Rabbah, R., ... & Tardieu, O. (2017, October). The serverless trilemma: Function composition for serverless computing. In Proceedings of the 2017 ACM SIGPLAN International Symposium on New Ideas, New Paradigms, and Reflections on Programming and Software (pp. 89-103).

Paščinski, U., Trnkoczy, J., Stankovski, V., Cigale, M., & Gec, S. (2018). QoS-Aware Orchestration of Network Intensive Software Utilities within Software Defined Data Centres: An Architecture and Implementation of a Global Cluster Manager. Journal of Grid Computing, 16, 85-112.

Kumar, M. (2018). Serverless computing for the Internet of Things.

Simioni, A. (2017). Implementation and evaluation of a container-based software architecture.

Komandla, V. Transforming Financial Interactions: Best Practices for Mobile Banking App Design and Functionality to Boost User Engagement and Satisfaction.

Gade, K. R. (2018). Real-Time Analytics: Challenges and Opportunities. Innovative Computer Sciences Journal, 4(1).

Gade, K. R. (2017). Integrations: ETL vs. ELT: Comparative analysis and best practices. Innovative Computer Sciences Journal, 3(1).

Naresh Dulam. The Rise of Kubernetes: Managing Containers in Distributed Systems. Distributed Learning and Broad Applications in Scientific Research, vol. 1, July 2015, pp. 73-94

Naresh Dulam. DataOps: Streamlining Data Management for Big Data and Analytics . Distributed Learning and Broad Applications in Scientific Research, vol. 2, Oct. 2016, pp. 28-50

Naresh Dulam, et al. Kubernetes Gains Traction: Orchestrating Data Workloads. Distributed Learning and Broad Applications in Scientific Research, vol. 3, May 2017, pp. 69-93

Naresh Dulam, et al. Snowflake Vs Redshift: Which Cloud Data Warehouse Is Right for You? . Distributed Learning and Broad Applications in Scientific Research, vol. 4, Oct. 2018, pp. 221-40

Downloads

Published

04-09-2019

How to Cite

[1]
Babulal Shaik, “Evaluating Kubernetes Pod Scaling Techniques for Event-Driven Applications ”, Distrib Learn Broad Appl Sci Res, vol. 5, pp. 1333–1350, Sep. 2019, Accessed: Jan. 03, 2025. [Online]. Available: https://dlabi.org/index.php/journal/article/view/259

Most read articles by the same author(s)

Similar Articles

11-16 of 16

You may also start an advanced similarity search for this article.