D. Freitag. Information Extraction from HTML: Application of a General Machine Learning Approach. In AAAI/IAAI 1998.
 Yoav Freund, H. Sebastian Seung, Eli Shamir, and Naftali Tishby. Selective Sampling Using the Query by Committee Algorithm. In Machine Learning v. 28; 2-3, pp. 133-168, 1997.
 M. Dubiner, Z. Galil, and E. Magin. Faster tree pattern matching. In J. Association of computing Machinery, 41 (2), pp. 205–213, March 1987.
 R. Cole, R. Hariharan and P. Indyk. Tree pattern matching and subset matching in deterministic O(nlog3n)-time. In In SODA: ACM-SIAM Symposium on Discrete Algorithms (A conference on Theoretical and Experimental Analysis of Discrete Algorithms) 1999.
 C. M. Hoffmann, M. J. O'Donnell. Pattern matching in trees. In J. Association of computing Machinery, 29 (1)), pp. 68-95, January 1982.
 T. Cormen, C. Lieserson, R. Rivest, and C Stein. Introduction to Algorithms. 2nd edition, 2001.
 K. Zhang, D. Shasha. Simple fast algorithms for the editing distance between trees and related problems. In SIAM J. Computing, 18 (6), pp. 1245–1262, December 1989.
 D. Reis, P. Golgher, A. Laender, A. da Silva. Automatic web news extraction using tree edit distance.. In WWW 2004.
 S. Soderland. Learning Information Extraction Rules for Semistructured and Free Text. In Machine Learning 1999.
 V. Vapnik. Statistical Learning Theory. New York, 1998.
 N. Kushmerick, D. S. Weld, and R. B. Doorenhos. Wrapper induction for information extraction. In In Intl. Joint Conference on Artificial Intelligence (IJCAI) pp. 729–737, 1997.
Computer Science and Artificial Intelligence Laboratory (CSAIL)
The Stata Center, Building 32 - 32 Vassar Street - Cambridge, MA 02139 - USA
tel:+1-617-253-0073 - firstname.lastname@example.org