Abstract

Introduction

그룹화 되어있는 Point set들의 관계를 지금까지는 잘 정의하지 못했음
complete-graph를 활용해 더 좋은 local representation을 얻고 Knn graph를 활용해 set들간의 관계 설립에 효과를 주자

SVGA-Net의 아케텍쳐는 크게 Voxel-graph network, Sparse-to-dense regression으로 이루어져있다.

B_m은 knn그래프를 통해 만든 이웃노드들 attention score ,And the final βm is the average of K neighbors.-> 모르겠음

(a) local complete graph: 같은 voxel안에 있는 node들이 attention score에 따라 aggregation
(b) global KNN graph: 3-NN graph 형태를 보여주고 있으며 화살표 방향이 propagation 방향이다. voxel를로 유도된 node를 attention score에 따라 aggregation

Block: Conv(f_in,f_out,k,s,p)-> ch,kernel,stride,padding size
high-resolution features와 low-resolution features 합친다. (pyramid network의 효과?)
In this way, the dense feature range of the lower level can be well combined with the sparse feature range of the higher level.
그런 다음 upsampling 과정과 함께 CNN을 거쳐 같은 사이즈의 feature map F를 만든다.
original sparse feature map인 b를 F와 element-wise한다. (more densely 한 효과가있음 -> ?)
ex) SSD는 위에서 언급한 문제를 해결하기 위해 low-level feature를 사용하지 않고, 전체 convolutional network 중간 지점부터 feature map을 추출합니다. 하지만 FPN 논문의 저자는 높은 해상도의 feature map은 작은 객체를 detect할 때 유용하기 때문에 이를 사용하지 않는 것은 적절하지 않다고 지적합니다.
Experiments
Point-GNN 하고 비교해서 성능이 낮은 부분의 설명은 다음과 같다.
local과 global grpah construction은 더 나은 feature를 capture할 수 있지만 80% 이상 occluded 된 물체에대해서는 local graph를 만들 수 없어서 그렇다고 설명한다.
The slight inferiority in the two detection tasks may be due to the fact that the local graph cannot be constructed for objects with occlusion ratio exceeding 80%.