tag:blogger.com,1999:blog-8980734269859237005.post3394846478274184701..comments2024-01-30T01:03:21.768-08:00Comments on DevCurry: Parse HTML using the HTML Agility PackSuprotim Agarwalhttp://www.blogger.com/profile/08349831623922214390noreply@blogger.comBlogger6125tag:blogger.com,1999:blog-8980734269859237005.post-14652113610445108102011-02-07T19:04:41.728-08:002011-02-07T19:04:41.728-08:00Can you share the Regex that you used.Can you share the Regex that you used.Suprotim Agarwalhttps://www.blogger.com/profile/08349831623922214390noreply@blogger.comtag:blogger.com,1999:blog-8980734269859237005.post-49518736683923112082011-02-07T04:46:20.661-08:002011-02-07T04:46:20.661-08:00I tried using HAP in one of my projects but the pe...I tried using HAP in one of my projects but the performance slowed down dramatically but with regex it was it was a piece of cake... <br /><br />for the curious one I am trying to crawl approx 1m domains to extract some selected information from links found on homepage and other pages linked from homepageJoshhttp://www.regexhacks.com/blognoreply@blogger.comtag:blogger.com,1999:blog-8980734269859237005.post-27729820322200935302009-12-27T06:55:22.275-08:002009-12-27T06:55:22.275-08:00This comment has been removed by the author.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-8980734269859237005.post-22529081439822320892009-12-27T01:37:01.121-08:002009-12-27T01:37:01.121-08:00You have to look on Data Extracting SDK (http://ex...You have to look on Data Extracting SDK (http://extracting.codeplex.com/)Anonymoushttps://www.blogger.com/profile/05872321685524349142noreply@blogger.comtag:blogger.com,1999:blog-8980734269859237005.post-29727671378176985712009-12-26T18:51:58.656-08:002009-12-26T18:51:58.656-08:00Regex is powerful, no doubt about that..but Regex ...Regex is powerful, no doubt about that..but Regex not a solution for parsing HTML.Suprotim Agarwalhttps://www.blogger.com/profile/08349831623922214390noreply@blogger.comtag:blogger.com,1999:blog-8980734269859237005.post-64229041789288385172009-12-26T11:46:43.764-08:002009-12-26T11:46:43.764-08:00Hap is not good i always use Regex where i need Ha...Hap is not good i always use Regex where i need Hap.Anonymousnoreply@blogger.com