Understanding URL - A Short Tutorial
Every document on the Web has a unique address. This address is known as Uniform Resource Locator (URL).
Several HTML/XHTML tags include a URL attribute value, including hyperlinks, inline images, and forms. All use the same syntax to specify the location of a web resource, regardless of the type or content of that resource. That's why it is known a Uniform Resource Locator.
A URL is made of up several parts, each of which offers information to theWeb browser to help find the page you are after. It is easier to learn the parts of a URL if you look at the most common ones first. If you look at the example URL given below, there are three key parts: the scheme, the host address, and the file path. The following sections discuss each of these in turn.
The scheme identifies the type of protocol and URL you are linking to and therefore how the resource should be retrieved. For example, most Web pages use something called the Hypertext Transfer Protocol (HTTP) to pass information to you which is why most Web pages start with http://.
There are other schemes available and you can use based on your requirement:
|http://||Hypertext Transfer Protocol (HTTP) is used to request pages from Web servers and send them back from Web servers to browsers.|
|https://||Secure Hypertext Transfer Protocol (HTTPS) encrypts the data sent between the browser and the Web server using a digital certificate.|
|ftp://||File Transfer Protocol is another method for transferring files on the Web. While HTTP is a lot more popular for viewing Web sites because of its integration with browsers, FTP is still commonly used protocol to transfer large files across the Web and to upload source files to your Web server.|
|file://||Used to indicate that a file is on the local hard disk or a shared directory on a LAN.|
The Host Address:
The host address is where a Web site can be found, either the IP address (four sets of numbers between 0 and 258, for example 126.96.36.199 ) or more commonly the domain name for a site such as www.tutorialspoint.com. Note that "www" is not actually part of the domain name although it is often used in the host address.it has nothing to do with the protocol used.
The filepath always begins with a forward slash character, and may consist of one or more directory or folder names. Each directory name is separated by forward slash characters and the filepath may end with a filename at the end. Here index.htm is the filename which is available in html directory:
Other Parts of the URL:
Credentials are a way of specifying a username and password for a password-protected part of a site. The credentials come before the host address, and are separated from the host address by an @ sign. Note how the username is separated from the password by a colon. The following URL shows the username admin and the password admin123:
Using the above URL, you can authenticate administrator and if provided ID and Password are correct then administrator will have access on index.htm file available in admin directory.
You can use a telnet URL to connect to a server as follows :
Another important information is Web Server Port Number. By default HTTP Server runs on port number 80. But if you are running a server on any other port number then it can be porvided as follows, assuming server is running on port 8080:
Fragment identifiers can be used after a filename to indicate a specific part of the page that a browser should go to immediately. Following is the example to reach to the top of page html_text_links.htm.
You can pass some information to the server using URL. When you use a form on aWeb page, such as a search form or an online order form, the browser can append the information you supply to the URL to pass information from you to the server as follows:
Here, the path arguments searchTerm=HTML are added to the URL to indicates the user is searching on the term HTML:
Absolute and Relative URLs:
You may address a URL in one of two ways:
Absolute - An absolute URL is the complete address of a resource everything your system needs to find a document and its server on the Web. For example http://www.tutorialspoint.com/html/html_text_links.htm
Relative: A relative URL indicates where the resource is in relation to the current page. Given URL is added with the <base> element to form a complete URL. For example /html/html_text_links.htm
Reserved and Unsafe Characters
Reserved characters are those that have a specific meaning within the URL itself. For example, the slash character separates elements of a pathname within a URL. If you need to include a slash in a URL that is not intended to be an element separator then you need to encode it as %2F:
Unsafe characters are those that have no special meaning within the URL but may have a special meaning in the context the URL is written. For example, double quotes ("") delimit URL attribute values in tags. If you were to include a double quotation mark directly in a URL, you would probably confuse the browser. Instead, you should encode the double quotation mark as avoid any possible conflict.
Please check HTML URL Encoding tutorial to understand this encoding and Reserved and Unsafe Characters.