What happens when reviewers find major flaws during artifact evaluation? Can artifact evaluation impact paper acceptance?