madeye
2017-03-22 13:31:17 +08:00
继续招人啦,这次同时招下面三个方向:
High performance CUDA kernel development
Develop super-fast kernels for cuDNN and Tensor RT
Requirement
CUDA programming and optimization
Assembly level optimization with SSE, AVX, or other SIMD instructions
Compiler
General compute architect
Develop methodology and evaluate compute features for future architecture
Model deep learning performance for future architecture
Requirement
Deep understanding of compute architecture in general
Good programming skill
DL algorithm
Analyze deep learning algorithms. Computation/Memory complexity. Computation pattern etc …
Requirement
Deep understanding of implementation of DL algorithms
Good compute architecture in general
Good programming skill