Please allow me to show my appreciation to your brilliant work?
But I have a question:When you directly test ScanQA and SQA3D, how do you extract the key objects? The original problems of these two datasets seem unlikely to extract using the rule-based approach.