집

멀티모달 모델

InternVL3-2B VS VILA1.5-13B

모델 이름	플랫폼	출시 시간	모델 매개 변수 수량	포괄적 인 점수
InternVL3-2B	Shanghai AI Laboratory & Tsinghua University	2025년 6월 1일	2.09B	3.0
VILA1.5-13B	NVIDIA	2025년 3월 1일	13B	2.4

왼쪽과 오른쪽으로 스와이프하여 더 보기

InternVL3-2B vs VILA1.5-13B ai 모델의 간단한 비교

종합 평가

두 모델 모두 다중모달 추론에서 낮은 성능을 보이며, 시각 세부 정보의 심각한 오해와 비논리적 추론이 존재하여 전체적인 능력 수준이 낮음을 나타냅니다。

멀티모달 추론

Both InternVL3-2B and VILA1.5-13B are weak in multimodal reasoning, exhibiting severe misinterpretation of visual information and shallow, chaotic cross-modal reasoning, with capabilities at a low level.

멀티모달 창작

InternVL3-2B과 VILA1.5-13B 모두 다중모달 창작에서 약하며 시각과 언어 간 심각한 단절, 얕고 혼란스러운 창의성을 보여 능력 수준이 낮습니다。