Why is it difficult to conduct research comparing English-only programs and various types of bilingual education programs to determine which is the most effective? And why must we be careful generalizing the findings of a single study conducted in a school or district to other schools and districts across the state, nation or around the world?