[1] D. Freitag. Information Extraction from HTML: Application of a General Machine Learning Approach. In AAAI/IAAI 1998.
[2] Yoav Freund, H. Sebastian Seung, Eli Shamir, and Naftali Tishby. Selective Sampling Using the Query by Committee Algorithm. In Machine Learning v. 28; 2-3, pp. 133-168, 1997.
[3] M. Dubiner, Z. Galil, and E. Magin. Faster tree pattern matching. In J. Association of computing Machinery, 41 (2), pp. 205–213, March 1987.
[4] R. Cole, R. Hariharan and P. Indyk. Tree pattern matching and subset matching in deterministic O(nlog3n)-time. In In SODA: ACM-SIAM Symposium on Discrete Algorithms (A conference on Theoretical and Experimental Analysis of Discrete Algorithms) 1999.
[5] C. M. Hoffmann, M. J. O'Donnell. Pattern matching in trees. In J. Association of computing Machinery, 29 (1)), pp. 68-95, January 1982.
[6] T. Cormen, C. Lieserson, R. Rivest, and C Stein. Introduction to Algorithms. 2nd edition, 2001.
[7] K. Zhang, D. Shasha. Simple fast algorithms for the editing distance between trees and related problems. In SIAM J. Computing, 18 (6), pp. 1245–1262, December 1989.
[8] D. Reis, P. Golgher, A. Laender, A. da Silva. Automatic web news extraction using tree edit distance.. In WWW 2004.
[9] S. Soderland. Learning Information Extraction Rules for Semistructured and Free Text. In Machine Learning 1999.
[10] V. Vapnik. Statistical Learning Theory. New York, 1998.
[11] N. Kushmerick, D. S. Weld, and R. B. Doorenhos. Wrapper induction for information extraction. In In Intl. Joint Conference on Artificial Intelligence (IJCAI) pp. 729–737, 1997.
![]() |
![]() The Stata Center, Building 32 - 32 Vassar Street - Cambridge, MA 02139 - USA tel:+1-617-253-0073 - publications@csail.mit.edu |