Towards Accurate and Compact Architectures via Neural Architecture Transformer

Abstract:

Deep neural networks succeed because of good architectures. Some Neural Architecture Search (NAS) methods search or manually design deep architectures. Even a well-designed/searched architecture may contain many nonsignificant or redundant modules/operations (e.g., intermediate convolution or pooling layers). Redundancy wastes memory, computational power, and performance. Thus, to improve performance without increasing computational cost, architecture operations must be optimized. We propose a Neural Architecture Transformer (NAT) method that turns the optimization problem into a Markov Decision Process (MDP) and replaces redundant operations with skip or null connection operations. NAT has a limited search space because it only considers a few replacements/transitions. Thus, a small search space may hinder architecture optimization. To improve architecture optimization, we propose a Neural Architecture Transformer++ (NAT++) method that expands the set of candidate transitions. We present a two-level transition rule to obtain valid transitions, allowing operations to have more efficient types (e.g., convolution ${\to }$ separable convolution) or smaller kernel sizes ($5{\times }5 {\to } 3{\times }3$). Valid transitions vary by operation. We suggest a Binary-Masked Softmax (BMSoftmax) layer to eliminate invalid transitions. Finally, we use policy gradient to learn an optimal policy from the MDP formulation to infer optimized architectures. The transformed architectures outperform both their original counterparts and those optimized by existing methods.

Note: Please discuss with our team before submitting this abstract to the college. This Abstract or Synopsis varies based on student project requirements.

Did you like this final year project?

To download this project Code with thesis report and project training... Click Here

Ameerpet

We are South India’s largest edu-tech company and Training institute in Hyderabad, India, proudly serving as the creator of a unique and innovative live project training platform for students, engineers, and researchers.

T-PAIR: Temporal Node-Pair Embedding for Automatic Biomedical Hypothesis Generation

Abstract: In this paper, we study an automatic hypothesis generation (HG) problem, which involves finding meaningful implicit connections between scientific terms like diseases, chemicals, drugs, and genes from biomedical publication databases. Most previous studies used…

More Than Privacy Applying Differential Privacy in Key Areas of Artificial Intelligence

Abstract: Recently, AI has garnered attention. Privacy, security, and model fairness issues have arisen alongside its advances. Differential privacy, a promising mathematical model, can help solve these problems. For this reason, differential privacy has been…

A Time-Series Feature-Based Recursive Classification Model to Optimize Treatment Strategies for Improving Outcomes and Resource Allocations of COVID-19 Patients

Abstract: This paper introduces a Lasso Logistic Regression model based on feature-based time series data to assess COVID-19 disease severity and when to administer drugs or escalate intervention procedures. The dynamic feature-based classification model used…

Discriminative Mixture Variational Autoencoder for Semisupervised Classification

Abstract: The discriminative mixture variational autoencoder (DMVAE) is a deep probability model for semisupervised learning feature extraction. Encoding, decoding, and classification modules comprise the DMVAE. The encoder projects the observation to the latent space in…

Research on Payment Settlement Mode in Cross-Border Business Trade Based on Blockchain Technology

Abstract: Cross-border commerce is vital to economic growth, and its payment and settlement methods have garnered attention. The current payment settlement method is inefficient, expensive, and insecure. This paper examined blockchain-based payment settlement models. This…

End-to-End Latency Analysis and Optimal Block Size of Proof-of-Work Blockchain Applications

Abstract: Due to growing interest in blockchain technology for secure, auditable, decentralized applications, its challenges must be addressed. This letter discusses the delay caused by Proof-of-Work (PoW) blockchains, which require consensus to append new information…

Towards Accurate and Compact Architectures via Neural Architecture Transformer

Abstract:

Did you like this final year project?

T-PAIR: Temporal Node-Pair Embedding for Automatic Biomedical Hypothesis Generation

More Than Privacy Applying Differential Privacy in Key Areas of Artificial Intelligence

A Time-Series Feature-Based Recursive Classification Model to Optimize Treatment Strategies for Improving Outcomes and Resource Allocations of COVID-19 Patients

Discriminative Mixture Variational Autoencoder for Semisupervised Classification

Research on Payment Settlement Mode in Cross-Border Business Trade Based on Blockchain Technology

End-to-End Latency Analysis and Optimal Block Size of Proof-of-Work Blockchain Applications

RESOURCES

COMPANY

WORK WITH US

Ameerpet Courses

Ameerpet Trainings

Ameerpet Projects

Abstract:

Did you like this final year project?

You may also like:

RESOURCES

COMPANY

WORK WITH US

Ameerpet Courses

Ameerpet Trainings

Ameerpet Projects