Class BRobotsTxt
java.lang.Object
|
+----java.util.Vector
|
+----BRobotsTxt
- public class BRobotsTxt
- extends Vector
This class uses a weak implementation of the Robot Exclusion Standard
as it is used commonly throughout the web. There is an Internet-Draft
for a more rigid version. Since this is yet not a standard and the author
believes that the existing convention offers good oppotunity to protect
websites from being indexed unwillingly, the following implementation
fits the requirements ;-)
For more information on this topic vistit http://info.webcrawler.com/mak/projects/robots/.
- Version:
- 0.1
- Author:
- Simon Berg
-
BRobotsTxt(URL, String)
- An object of this class represents the robot-protection information of one internet site.
-
isAllowed(URL)
- Returns true if robot-access of url is allowed, false otherwise
BRobotsTxt
public BRobotsTxt(URL url,
String robotName) throws NoRobotInformationException
- An object of this class represents the robot-protection information of one internet site.
- Parameters:
- url - can be a site name with directory name and file name.
- robotName - will be processed case-insensitive
- Throws: NoRobotInformationException
- if no information is found.
isAllowed
public boolean isAllowed(URL url)
- Returns true if robot-access of url is allowed, false otherwise