NAS-Bench-Suite - NAS Evaluation is (Now) Surprisingly Easy

[Abstract]

[1. Introduction]

→ NAS method들은 공정한 비교가 어려움

→ 벤치마크 데이터 생김

→ 초창기 벤치마크는 small serach space나 image classification에 국한됨

→ 벤치마크 데이터셋은 일관성이 떨어짐 (abstractions, capabilities, implementations)

⇒ 유의미한 분석을 제시

⇒ NAS-Bench-Suite를 제시 (unified interface)

[2. NAS Benchmarks Overview]

most search space
- cell-based (micro) structure: DAG form
- macro structure: architecture skeletons, arrangement of cells, such as how many times each cell is duplicated

Untitled

[3. NAS Benchmarks Statistics]

NAS-Bench-Suite: the first large-scale aggregation of statistics computed on NAS Benchmarks
- assess the level of locality (similarity of validation accuracy among neighboring architectures)
  
  ⇒ NAS-Bench-201 Imagenet, CIFAR 10 (highest autocorrelation)
diversity
- DARTS의 std가 NAS-Bench-201보다 낮기 때문에, NAS-Bench-201의 0.1% optimal한 arcitecture을 찾는 것이 더 인상적일 것이다
locality와 neighborhood size도 NAS 벤치마크의 어려움에 영향을 끼칠 수도 있다

[4. On the Generalizability of NAS Algorithms]

5 black-box algorithms
- iteratively chooses architectures to train
- uses the final validation accuracies
- random search, regularized evolution, local search, BANNAS, NPENAS
5 performance predictors
- predict performance of untrained architectures by training model using a set of already evaluated architectures
- BOHAMIANN, Gaussian process, Random forest, Neural architecture optimziation, XGBoost
3 one-shot methods

⇒ assess 3 assumptions (refer to introduction)

[4.1 The Best NAS Methods]

black-box algorithms
- no algorithm performs well across al search space
performance predictor
- with default parameters: RF
- with tuned parameters: XGBoost

Untitled

[4.2 Generalizability of Hyperparameters]

Untitled

[4.3 One-shot Algorithms]

Untitled