Results (
Indonesian) 1: 
[Copy]Copied!
to 0.89 for various grade and socioeconomic groups. The investigators concluded thattheir test “appears to be a usable diagnostic instrument.” The major limitation theyidentified with their study was “the lack of external criteria for determining the validityof the test items, and the validity of the identified skills as a part of the problem-solvingprocess.” Still their research suggests fruitful avenues for others to pursue.Butts[ 161 developed an X-35 Test of Problem Solving (forms A and B) to assess thesebehaviors:( I ) Early formation of a hypothesis.(2) Specific experimentation with relevant variables as contrasted to random guessing.(3) Introduction of control to test the validity of a hypothesis selected.(4) Specific attempts at verification of the hypothesis.The testing situation placed the student in as natural a problem situation as possible inwhich the student could “select the kinds and amounts of information he believed wouldbest enable him to solve the problem.” Via a “tab” format, the student pulls the tab onall items that are considered helpful in solving a specific problem. The available datawere catagorized as:(i)(ii) Additional or extra information.(iii) Duplicate information.(iv) Irrelevant information.Once a “tab” has been pulled, it cannot be replaced, so the examiner has a record ofwhich items were used and in what sequence. The student responses were evaluated bythree professors based on these judgments:to the problem solution?variables to the problem?(C) Did the student introduce any controls into his thinking to test his hypothesis?On a scale of one to five, the judges evaluated each student’s responses with respect tothose four questions producing scores ranging from 4 to 20. A score of four indicates littleor no evidence of this problem-solving methodology in their thinking, while a score of20 indicates definite evidence of such structuring and thinking. Butts cited agreementbetween the evaluation of the investigator and the judges as evidence of construct validityof the test. Interpreting the two forms of the test as halves of one test, Butts correlatedstudent scores from both forms to obtain a reliability coefficient of 0.54. As an innovativeattempt in a very complex domain, this test should be scrutinized closely.Secondary School LevelThe Process of Science Test (POST) [ 171 formerly called the Impact Test, is composedof 40 four-choice items designed by the Biological Science Curriculum Study (BSCS)to measure the “ability of students to recognize adequate criteria for accepting or rejectinghypotheses, and to evaluate the general structure of experimental design in science, includingthe need for controls, repeatability, adequate sampling, and careful measurement.”The POST was to be one phase of the BSCS evaluation program. Items on thePOST are framed in biological science settings, but the authors claim knowledge ofbiology is not a prerequisite for scoring high on the test. Many of the items are based ontabular or graphical presentations of data or sketches of experimental setups. It wasdeveloped to be used both as a pre- and a posttest with biology or other classes in whichthe processes of science are important objectives. The test manual includes norms, reliabilityinformation, and correlations with measures of mental ability. The present form,copyrighted in 1963, was administered to more than 28,000 students at the beginningof the 1962- 1963 school year and to 24,000 at the end of the school year. Generally, thelast 20 items are of lower quality when compared with the excellent quality of the first20 items. A major weakness of the POST is that it lacks a precise table of specificationsor categorization of items as to specific processes of skills. Despite this weakness, thePOST is one of the few standardized tests in this area for secondary level students.More recently, Tannenbaum[ 181 developed an “instrument to assess achievementand diagnose weaknesses in the use of scientific processes by students in grades seven,eight, and nine” entitled the Test of Science Processes (TOSP). The test is based on theseprocesses: Observing, Comparing, Quantifying, Classifying, Measuring, Experimenting,Inferring, and Predicting. These processes were chosen after consulting the relevantSastra, sehingga ketergantungan pada SAPA model dimengerti. TOSP memiliki 96lima-pilihan item yang memerlukan 73 menit total pengujian waktu-biasanya didistribusikandalam dua sittings terpisah. Item pertama 12 Berdasarkan 35 mm warna slide dan besarmayoritas sisa barang disertai dengan gambar atau data. Di validasistudi, TOSP telah diberikan kepada lebih dari 3.600 siswa yang dipilih untuk mewakili semua kemampuantingkat dan rentang lebar SES latar belakang. Keandalan KR-20 tes total denganseluruh adalah 0.91. Keandalan perkiraan subtests (untuk masing-masing proses)bervariasi dari 0,30 0.78 dengan subtest c
Being translated, please wait..
