COLLECTED BY
Organization:
Internet Archive
The Internet Archive discovers and captures web pages through many different web crawls.
At any given time several distinct crawls are running, some for months, and some every day or longer.
View the web archive through the
Wayback Machine.
The seed for Wide00014 was:
- Slash pages from every domain on the web:
-- a list of domains using Survey crawl seeds
-- a list of domains using Wide00012 web graph
-- a list of domains using Wide00013 web graph
- Top ranked pages (up to a max of 100) from every linked-to domain using the Wide00012 inter-domain navigational link graph
-- a ranking of all URLs that have more than one incoming inter-domain link (rank was determined by number of incoming links using Wide00012 inter domain links)
-- up to a maximum of 100 most highly ranked URLs per domain
The seed list contains a total of 431,055,452 URLsThe seed list was further filtered to exclude known porn, and link farm, domainsThe modified seed list contains a total of 428M URLs
The Wayback Machine - https://web.archive.org/web/20160912005250/http://www.cambridgesoft.com/services/documentation/sdk/chemdraw/cdx/properties/Atom_RestrictSubstituentsUpTo.htm
Atom_RestrictSubstituentsUpTo Property
CDXML Name: | SubstituentsUpTo |
CDX Constant Name: | kCDXProp_Atom_RestrictSubstituentsUpTo |
CDX Constant Value: | 0x0435 |
Data Size: | UINT8 |
Property of objects: | kCDXObj_Node |
| |
First written/read in: | ChemDraw 4.0 |
Required? | No |
Description:
Indicates that substitution is restricted to no more than the specified value.
A substituent is defined as some other non-hydrogen node bonded to this one. It is strictly a count of attached bonds (to non-hydrogen atoms), and not a sum of bond orders. This definition of "substituent" exactly matches the definition used by ISIS.
Note that it is possible to assign impossible values, for example by saying that one of the atoms in benzene has no more than one substituent (when it already has two, by nature of being part of the benzene ring). Impossibilities of that sort are not forbidden by this specification, but it is assumed that such a structure would match nothing else if posed as a structure query in some database.
If this property is absent:
The node is treated as unrestricted in terms of substitution.
CDX Documentation index