A novel weight normalization technique to improve Generative Adversarial Network training

Sarbaree Mishra

A novel weight normalization technique to improve Generative Adversarial Network training

Authors

Sarbaree Mishra Program Manager at Molina Healthcare Inc., USA Author

Keywords:

Weight Normalization, Generative Adversarial Networks

Abstract

Generative Adversarial Networks (GANs) have emerged as a groundbreaking framework for generating realistic data across various domains. Yet, their training still needs to be more manageable due to mode collapse and instability. This paper introduces a novel weight normalization technique designed to enhance the training process of GANs by improving convergence rates and overall model performance. Traditional approaches often rely on simple weight scaling or standard normalization methods that may not fully address the unique challenges posed by the adversarial training dynamic. Our proposed technique applies a more tailored normalization strategy that adapts to the evolving distribution of weights during training, ensuring more consistent gradient flow and better representational capacity. Through extensive experimentation, we demonstrate that our weight normalization approach significantly reduces the variance in generated samples, leading to higher fidelity outputs and a more stable training process. We also provide a comprehensive analysis of the impact of weight normalization on both the generator and discriminator networks, highlighting its effectiveness in mitigating common pitfalls associated with GAN training. Our findings suggest that integrating this novel technique enhances the quality of generated samples and facilitates a smoother training experience, making it easier for practitioners to deploy GANs in real-world applications. This work contributes to the ongoing efforts to refine GAN architectures and training methodologies, offering a promising avenue for further research in generative modeling. By presenting a fresh perspective on weight normalization, we aim to inspire subsequent advancements in the field, ultimately broadening the scope and applicability of GANs across various industries.

Downloads

Download data is not yet available.

References

Salimans, T., & Kingma, D. P. (2016). Weight normalization: A simple reparameterization to accelerate training of deep neural networks. Advances in neural information processing systems, 29.

Roth, K., Lucchi, A., Nowozin, S., & Hofmann, T. (2017). Stabilizing training of

generative adversarial networks through regularization. Advances in neural information processing systems, 30.

Salimans, T., Goodfellow, I., Zaremba, W., Cheung, V., Radford, A., & Chen, X. (2016). Improved techniques for training gans. Advances in neural information

processing systems, 29.

Kadurin, A., Nikolenko, S., Khrabrov, K., Aliper, A., & Zhavoronkov, A. (2017).

druGAN: an advanced generative adversarial autoencoder model for de novo generation of new molecules with desired molecular properties in silico. Molecular pharmaceutics, 14(9), 3098-3104.

Ba, J. L. (2016). Layer normalization. arXiv preprint arXiv:1607.06450.

Goodfellow, I. (2016). Nips 2016 tutorial: Generative adversarial networks. arXiv preprint arXiv:1701.00160.

Hayes, J., Melis, L., Danezis, G., & De Cristofaro, E. (2017). Logan: Evaluating information leakage of generative models using generative adversarial networks. arXiv preprint arXiv:1705.07663, 18.

Brock, A., Lim, T., Ritchie, J. M., & Weston, N. (2016). Neural photo editing with introspective adversarial networks. arXiv preprint arXiv:1609.07093.

Yang, G., Yu, S., Dong, H., Slabaugh, G., Dragotti, P. L., Ye, X., ... & Firmin, D.

(2017). DAGAN: deep de-aliasing generative adversarial networks for fast compressed sensing MRI reconstruction. IEEE transactions on medical imaging, 37(6), 1310-1321.

Li, C., & Wand, M. (2016). Precomputed real-time texture synthesis with markovian generative adversarial networks. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part III 14 (pp. 702-716). Springer International Publishing.

Bousmalis, K., Silberman, N., Dohan, D., Erhan, D., & Krishnan, D. (2017). Unsupervised pixel-level domain adaptation with generative adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3722-3731).

Jetchev, N., Bergmann, U., & Vollgraf, R. (2016). Texture synthesis with spatial

generative adversarial networks. arXiv preprint arXiv:1611.08207.

Mahapatra, D., & Bozorgtabar, B. (2017). Retinal vasculature segmentation using local saliency maps and generative adversarial networks for image super resolution. arXiv preprint arXiv:1710.04783.

Mun, S., Park, S., Han, D. K., & Ko, H. (2017, September). Generative Adversarial Network Based Acoustic Scene Training Set Augmentation and Selection Using SVM Hyper-Plane. In DCASE (pp. 93-102).

Wang, D., & Liu, Q. (2016). Learning to draw samples: With application to amortized mle for generative adversarial learning. arXiv preprint

arXiv:1611.01722.

Gade, K. R. (2018). Real-Time Analytics: Challenges and Opportunities. Innovative Computer Sciences Journal, 4(1).

Komandla, V. Transforming Financial Interactions: Best Practices for Mobile Banking App Design and Functionality to Boost User Engagement and Satisfaction.

Gade, K. R. (2017). Integrations: ETL/ELT, Data Integration Challenges, Integration Patterns. Innovative Computer Sciences Journal, 3(1).

Gade, K. R. (2017). Migrations: Challenges and Best Practices for Migrating Legacy Systems to Cloud-Based Platforms. Innovative Computer Sciences Journal, 3(1).

Downloads

Published

24-09-2019

Issue

Vol. 5 (2019): Distributed Learning and Broad Applications in Scientific Research

Section

Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

License Terms

Ownership and Licensing:

Authors of research papers submitted to Distributed Learning and Broad Applications in Scientific Research retain the copyright of their work while granting the journal certain rights. Authors maintain ownership of the copyright and have granted the journal a right of first publication. Simultaneously, authors agree to license their research papers under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License.

License Permissions:

Under the CC BY-NC-SA 4.0 License, others are permitted to share and adapt the work, as long as proper attribution is given to the authors and acknowledgement is made of the initial publication in the journal. This license allows for the broad dissemination and utilization of research papers.

Additional Distribution Arrangements:

Authors are free to enter into separate contractual arrangements for the non-exclusive distribution of the journal's published version of the work. This may include posting the work to institutional repositories, publishing it in journals or books, or other forms of dissemination. In such cases, authors are requested to acknowledge the initial publication of the work in this journal.

Online Posting:

Authors are encouraged to share their work online, including in institutional repositories, disciplinary repositories, or on their personal websites. This permission applies both prior to and during the submission process to the journal. Online sharing enhances the visibility and accessibility of the research papers.

Responsibility and Liability:

Authors are responsible for ensuring that their research papers do not infringe upon the copyright, privacy, or other rights of any third party. Scientific Research Canada disclaims any liability or responsibility for any copyright infringement or violation of third-party rights in the research papers.

If you have any questions or concerns regarding these license terms, please contact us at editor@dlabi.org.

Most read articles by the same author(s)

Sarbaree Mishra, Moving data warehousing and analytics to the cloud to improve scalability, performance and cost-efficiency , Distributed Learning and Broad Applications in Scientific Research: Vol. 6 (2020): Distributed Learning and Broad Applications in Scientific Research
Sarbaree Mishra, Vineela Komandla, Srikanth Bandi, Sairamesh Konidala, Jeevan Manda, Training AI models on sensitive data - the Federated Learning approach , Distributed Learning and Broad Applications in Scientific Research: Vol. 6 (2020): Distributed Learning and Broad Applications in Scientific Research
Sarbaree Mishra, Sairamesh Konidala, Jeevan Manda, Improving the ETL process through declarative transformation languages , Distributed Learning and Broad Applications in Scientific Research: Vol. 5 (2019): Distributed Learning and Broad Applications in Scientific Research
Sarbaree Mishra, Automating the data integration and ETL pipelines through machine learning to handle massive datasets in the Enterprise , Distributed Learning and Broad Applications in Scientific Research: Vol. 6 (2020): Distributed Learning and Broad Applications in Scientific Research
Sarbaree Mishra, Vineela Komandla, Srikanth Bandi, Jeevan Manda, Training models for the enterprise - A privacy preserving approach , Distributed Learning and Broad Applications in Scientific Research: Vol. 5 (2019): Distributed Learning and Broad Applications in Scientific Research
Sarbaree Mishra, A Distributed Training Approach to Scale Deep Learning to Massive Datasets , Distributed Learning and Broad Applications in Scientific Research: Vol. 5 (2019): Distributed Learning and Broad Applications in Scientific Research
Sarbaree Mishra, Distributed data warehouses - An alternative approach to highly performant data warehouses , Distributed Learning and Broad Applications in Scientific Research: Vol. 5 (2019): Distributed Learning and Broad Applications in Scientific Research

A novel weight normalization technique to improve Generative Adversarial Network training

Authors

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

License Terms

Ownership and Licensing:

License Permissions:

Additional Distribution Arrangements:

Online Posting:

Responsibility and Liability:

Most read articles by the same author(s)

Similar Articles

Journal Snapshot

Make a Submission

Invitation for Submissions