-1

I am trying to parse a web document. Using HtmlAgilityPack (C#) That is exactly what looking for href value of a tag.

I'm parsing the http://www.ntis.go.kr/ThRndGateList.do

Although successful parse, the value is slightly different. I do not know why.

The actual value of the web is as follows:

enter image description here

The value obtained through the Htmlagilitypack are as follows:

enter image description here

As you can see, this strange starting with "jsessionid" value in the href value is obtained. What reason?

Thank you for regards.

Hyunjin-Kim
  • 129
  • 1
  • 2
  • 11

1 Answers1

0

It is probably because in your browser (in your case Chrome) you are logged in. If you make the request via HtmlAgilityPack you are like a freshly open browser:

  • Not logged in
  • Never on this page before

The Webapplication your trying to use generates a JSESSIONID when someone opens the page for the first time and this id is transfered via the URL.

This question could help you to understand the technology behind the webapplication: Under what conditions is a JSESSIONID created?

Community
  • 1
  • 1
Claudio P
  • 2,028
  • 3
  • 20
  • 41