HTML parser understand the following links:
<A HREF="xxx">
<A HREF="xxx" DATA-EXPANDED-URL="yyy" DATA-ULTIMATE-URL="zzz">
Attributes priority in link selection: data-ultimate-url, data-expanded-url, href.
<IMG SRC="xxx">
<LINK HREF="xxx">
<FRAME SRC="xxx">
<AREA HREF="xxx">
<BASE HREF="xxx">
Note: If BASE HREF value has incorrectly formed URL, current one will be used instead to compose relative links.
However, you can specify the list of HTML which would be omitted in new href lookup with SkipHrefIn command.
SkipHrefIn "img, link, script"
By default, DataparkSearch does not follow links with rel=nofollow attribute specified. But you can alter this behaviour with "DisableRelNoFollow yes" command. You need to put this command in your indexer.conf file.