回 Jason 主站·Embodied AI Reading Station
没主意?快捷入口
Heatmap · 标签共现矩阵

21 × 21 共现强度。

格子越深,两个 tag 共同出现的论文越多。对角线 = 该 tag 自身论文数。看这页能发现"谁经常和谁一起出现"——比如 transformer × VLA 高,RF × tactile 几乎为零。

visiontransformerlanguagediffusionimitationRLdatasetmanipulationVLA3DVLMaudio-speechnavigationsim2realRF-radarworld-modelflow-matchingmamba-ssmlocomotiontactileopen-source visiontransformerlanguagediffusionimitationRLdatasetmanipulationVLA3DVLMaudio-speechnavigationsim2realRF-radarworld-modelflow-matchingmamba-ssmlocomotiontactileopen-source vision: 92 92 vision × transformer: 55 55 vision × language: 46 46 vision × diffusion: 38 38 vision × imitation: 25 25 vision × RL: 14 14 vision × dataset: 26 26 vision × manipulation: 22 22 vision × VLA: 17 17 vision × 3D: 17 17 vision × VLM: 23 23 vision × audio-speech: 10 10 vision × navigation: 9 9 vision × sim2real: 9 9 vision × RF-radar: 10 10 vision × world-model: 4 4 vision × flow-matching: 9 9 vision × mamba-ssm: 4 4 vision × locomotion: 1 vision × tactile: 5 5 vision × open-source: 2 transformer × vision: 55 55 transformer: 78 78 transformer × language: 47 47 transformer × diffusion: 42 42 transformer × imitation: 24 24 transformer × RL: 18 18 transformer × dataset: 15 15 transformer × manipulation: 13 13 transformer × VLA: 19 19 transformer × 3D: 10 10 transformer × VLM: 17 17 transformer × audio-speech: 13 13 transformer × navigation: 6 6 transformer × sim2real: 4 4 transformer × RF-radar: 5 5 transformer × world-model: 8 8 transformer × flow-matching: 10 10 transformer × mamba-ssm: 5 5 transformer × locomotion: 2 transformer × tactile: 4 4 transformer × open-source: 0 language × vision: 46 46 language × transformer: 47 47 language: 73 73 language × diffusion: 25 25 language × imitation: 17 17 language × RL: 18 18 language × dataset: 19 19 language × manipulation: 11 11 language × VLA: 22 22 language × 3D: 6 6 language × VLM: 21 21 language × audio-speech: 7 7 language × navigation: 2 language × sim2real: 4 4 language × RF-radar: 1 language × world-model: 6 6 language × flow-matching: 8 8 language × mamba-ssm: 4 4 language × locomotion: 2 language × tactile: 2 language × open-source: 2 diffusion × vision: 38 38 diffusion × transformer: 42 42 diffusion × language: 25 25 diffusion: 58 58 diffusion × imitation: 30 30 diffusion × RL: 18 18 diffusion × dataset: 9 9 diffusion × manipulation: 22 22 diffusion × VLA: 18 18 diffusion × 3D: 13 13 diffusion × VLM: 6 6 diffusion × audio-speech: 3 3 diffusion × navigation: 6 6 diffusion × sim2real: 7 7 diffusion × RF-radar: 3 3 diffusion × world-model: 8 8 diffusion × flow-matching: 13 13 diffusion × mamba-ssm: 2 diffusion × locomotion: 2 diffusion × tactile: 1 diffusion × open-source: 0 imitation × vision: 25 25 imitation × transformer: 24 24 imitation × language: 17 17 imitation × diffusion: 30 30 imitation: 48 48 imitation × RL: 19 19 imitation × dataset: 7 7 imitation × manipulation: 20 20 imitation × VLA: 11 11 imitation × 3D: 8 8 imitation × VLM: 3 3 imitation × audio-speech: 1 imitation × navigation: 4 4 imitation × sim2real: 10 10 imitation × RF-radar: 1 imitation × world-model: 4 4 imitation × flow-matching: 7 7 imitation × mamba-ssm: 2 imitation × locomotion: 1 imitation × tactile: 1 imitation × open-source: 0 RL × vision: 14 14 RL × transformer: 18 18 RL × language: 18 18 RL × diffusion: 18 18 RL × imitation: 19 19 RL: 41 41 RL × dataset: 8 8 RL × manipulation: 13 13 RL × VLA: 8 8 RL × 3D: 9 9 RL × VLM: 3 3 RL × audio-speech: 0 RL × navigation: 4 4 RL × sim2real: 8 8 RL × RF-radar: 0 RL × world-model: 12 12 RL × flow-matching: 0 RL × mamba-ssm: 6 6 RL × locomotion: 4 4 RL × tactile: 0 RL × open-source: 0 dataset × vision: 26 26 dataset × transformer: 15 15 dataset × language: 19 19 dataset × diffusion: 9 9 dataset × imitation: 7 7 dataset × RL: 8 8 dataset: 40 40 dataset × manipulation: 11 11 dataset × VLA: 5 5 dataset × 3D: 7 7 dataset × VLM: 12 12 dataset × audio-speech: 6 6 dataset × navigation: 3 3 dataset × sim2real: 6 6 dataset × RF-radar: 4 4 dataset × world-model: 2 dataset × flow-matching: 1 dataset × mamba-ssm: 1 dataset × locomotion: 2 dataset × tactile: 2 dataset × open-source: 1 manipulation × vision: 22 22 manipulation × transformer: 13 13 manipulation × language: 11 11 manipulation × diffusion: 22 22 manipulation × imitation: 20 20 manipulation × RL: 13 13 manipulation × dataset: 11 11 manipulation: 35 35 manipulation × VLA: 6 6 manipulation × 3D: 9 9 manipulation × VLM: 2 manipulation × audio-speech: 0 manipulation × navigation: 5 5 manipulation × sim2real: 9 9 manipulation × RF-radar: 0 manipulation × world-model: 0 manipulation × flow-matching: 6 6 manipulation × mamba-ssm: 0 manipulation × locomotion: 3 3 manipulation × tactile: 1 manipulation × open-source: 0 VLA × vision: 17 17 VLA × transformer: 19 19 VLA × language: 22 22 VLA × diffusion: 18 18 VLA × imitation: 11 11 VLA × RL: 8 8 VLA × dataset: 5 5 VLA × manipulation: 6 6 VLA: 33 33 VLA × 3D: 5 5 VLA × VLM: 9 9 VLA × audio-speech: 1 VLA × navigation: 1 VLA × sim2real: 6 6 VLA × RF-radar: 0 VLA × world-model: 4 4 VLA × flow-matching: 6 6 VLA × mamba-ssm: 3 3 VLA × locomotion: 0 VLA × tactile: 2 VLA × open-source: 1 3D × vision: 17 17 3D × transformer: 10 10 3D × language: 6 6 3D × diffusion: 13 13 3D × imitation: 8 8 3D × RL: 9 9 3D × dataset: 7 7 3D × manipulation: 9 9 3D × VLA: 5 5 3D: 28 28 3D × VLM: 2 3D × audio-speech: 2 3D × navigation: 2 3D × sim2real: 4 4 3D × RF-radar: 8 8 3D × world-model: 0 3D × flow-matching: 0 3D × mamba-ssm: 0 3D × locomotion: 1 3D × tactile: 1 3D × open-source: 0 VLM × vision: 23 23 VLM × transformer: 17 17 VLM × language: 21 21 VLM × diffusion: 6 6 VLM × imitation: 3 3 VLM × RL: 3 3 VLM × dataset: 12 12 VLM × manipulation: 2 VLM × VLA: 9 9 VLM × 3D: 2 VLM: 27 27 VLM × audio-speech: 0 VLM × navigation: 0 VLM × sim2real: 1 VLM × RF-radar: 0 VLM × world-model: 1 VLM × flow-matching: 2 VLM × mamba-ssm: 1 VLM × locomotion: 0 VLM × tactile: 0 VLM × open-source: 1 audio-speech × vision: 10 10 audio-speech × transformer: 13 13 audio-speech × language: 7 7 audio-speech × diffusion: 3 3 audio-speech × imitation: 1 audio-speech × RL: 0 audio-speech × dataset: 6 6 audio-speech × manipulation: 0 audio-speech × VLA: 1 audio-speech × 3D: 2 audio-speech × VLM: 0 audio-speech: 22 22 audio-speech × navigation: 4 4 audio-speech × sim2real: 0 audio-speech × RF-radar: 3 3 audio-speech × world-model: 0 audio-speech × flow-matching: 0 audio-speech × mamba-ssm: 0 audio-speech × locomotion: 0 audio-speech × tactile: 1 audio-speech × open-source: 0 navigation × vision: 9 9 navigation × transformer: 6 6 navigation × language: 2 navigation × diffusion: 6 6 navigation × imitation: 4 4 navigation × RL: 4 4 navigation × dataset: 3 3 navigation × manipulation: 5 5 navigation × VLA: 1 navigation × 3D: 2 navigation × VLM: 0 navigation × audio-speech: 4 4 navigation: 18 18 navigation × sim2real: 3 3 navigation × RF-radar: 7 7 navigation × world-model: 1 navigation × flow-matching: 1 navigation × mamba-ssm: 0 navigation × locomotion: 2 navigation × tactile: 1 navigation × open-source: 0 sim2real × vision: 9 9 sim2real × transformer: 4 4 sim2real × language: 4 4 sim2real × diffusion: 7 7 sim2real × imitation: 10 10 sim2real × RL: 8 8 sim2real × dataset: 6 6 sim2real × manipulation: 9 9 sim2real × VLA: 6 6 sim2real × 3D: 4 4 sim2real × VLM: 1 sim2real × audio-speech: 0 sim2real × navigation: 3 3 sim2real: 18 18 sim2real × RF-radar: 0 sim2real × world-model: 2 sim2real × flow-matching: 0 sim2real × mamba-ssm: 1 sim2real × locomotion: 4 4 sim2real × tactile: 1 sim2real × open-source: 0 RF-radar × vision: 10 10 RF-radar × transformer: 5 5 RF-radar × language: 1 RF-radar × diffusion: 3 3 RF-radar × imitation: 1 RF-radar × RL: 0 RF-radar × dataset: 4 4 RF-radar × manipulation: 0 RF-radar × VLA: 0 RF-radar × 3D: 8 8 RF-radar × VLM: 0 RF-radar × audio-speech: 3 3 RF-radar × navigation: 7 7 RF-radar × sim2real: 0 RF-radar: 17 17 RF-radar × world-model: 0 RF-radar × flow-matching: 1 RF-radar × mamba-ssm: 0 RF-radar × locomotion: 0 RF-radar × tactile: 0 RF-radar × open-source: 0 world-model × vision: 4 4 world-model × transformer: 8 8 world-model × language: 6 6 world-model × diffusion: 8 8 world-model × imitation: 4 4 world-model × RL: 12 12 world-model × dataset: 2 world-model × manipulation: 0 world-model × VLA: 4 4 world-model × 3D: 0 world-model × VLM: 1 world-model × audio-speech: 0 world-model × navigation: 1 world-model × sim2real: 2 world-model × RF-radar: 0 world-model: 15 15 world-model × flow-matching: 0 world-model × mamba-ssm: 6 6 world-model × locomotion: 1 world-model × tactile: 0 world-model × open-source: 0 flow-matching × vision: 9 9 flow-matching × transformer: 10 10 flow-matching × language: 8 8 flow-matching × diffusion: 13 13 flow-matching × imitation: 7 7 flow-matching × RL: 0 flow-matching × dataset: 1 flow-matching × manipulation: 6 6 flow-matching × VLA: 6 6 flow-matching × 3D: 0 flow-matching × VLM: 2 flow-matching × audio-speech: 0 flow-matching × navigation: 1 flow-matching × sim2real: 0 flow-matching × RF-radar: 1 flow-matching × world-model: 0 flow-matching: 13 13 flow-matching × mamba-ssm: 0 flow-matching × locomotion: 0 flow-matching × tactile: 0 flow-matching × open-source: 0 mamba-ssm × vision: 4 4 mamba-ssm × transformer: 5 5 mamba-ssm × language: 4 4 mamba-ssm × diffusion: 2 mamba-ssm × imitation: 2 mamba-ssm × RL: 6 6 mamba-ssm × dataset: 1 mamba-ssm × manipulation: 0 mamba-ssm × VLA: 3 3 mamba-ssm × 3D: 0 mamba-ssm × VLM: 1 mamba-ssm × audio-speech: 0 mamba-ssm × navigation: 0 mamba-ssm × sim2real: 1 mamba-ssm × RF-radar: 0 mamba-ssm × world-model: 6 6 mamba-ssm × flow-matching: 0 mamba-ssm: 8 8 mamba-ssm × locomotion: 0 mamba-ssm × tactile: 0 mamba-ssm × open-source: 0 locomotion × vision: 1 locomotion × transformer: 2 locomotion × language: 2 locomotion × diffusion: 2 locomotion × imitation: 1 locomotion × RL: 4 4 locomotion × dataset: 2 locomotion × manipulation: 3 3 locomotion × VLA: 0 locomotion × 3D: 1 locomotion × VLM: 0 locomotion × audio-speech: 0 locomotion × navigation: 2 locomotion × sim2real: 4 4 locomotion × RF-radar: 0 locomotion × world-model: 1 locomotion × flow-matching: 0 locomotion × mamba-ssm: 0 locomotion: 6 6 locomotion × tactile: 0 locomotion × open-source: 0 tactile × vision: 5 5 tactile × transformer: 4 4 tactile × language: 2 tactile × diffusion: 1 tactile × imitation: 1 tactile × RL: 0 tactile × dataset: 2 tactile × manipulation: 1 tactile × VLA: 2 tactile × 3D: 1 tactile × VLM: 0 tactile × audio-speech: 1 tactile × navigation: 1 tactile × sim2real: 1 tactile × RF-radar: 0 tactile × world-model: 0 tactile × flow-matching: 0 tactile × mamba-ssm: 0 tactile × locomotion: 0 tactile: 5 5 tactile × open-source: 0 open-source × vision: 2 open-source × transformer: 0 open-source × language: 2 open-source × diffusion: 0 open-source × imitation: 0 open-source × RL: 0 open-source × dataset: 1 open-source × manipulation: 0 open-source × VLA: 1 open-source × 3D: 0 open-source × VLM: 1 open-source × audio-speech: 0 open-source × navigation: 0 open-source × sim2real: 0 open-source × RF-radar: 0 open-source × world-model: 0 open-source × flow-matching: 0 open-source × mamba-ssm: 0 open-source × locomotion: 0 open-source × tactile: 0 open-source: 2

★ 对角线 mustard = 该 tag 论文数 / 非对角 coral = 共现数 / 数字 ≥3 才显示