Extracting Domain Name from URL with Perl
I oftentimes need to extract the domain name from a URL in the software I write. And I thought I would share my method. This is using the Perl language.
This method consists of 3 consecutive regular expressions.
It discards any leading www. from the domain name. But it does not discard other third-level domain names.
First, remove the http/https and possible www. from the front of the URL:
$url =~ s!^https?://(?:www\.)?!!i;
The above uses "!" as operator delimiters instead of "/" to avoid having to escape the embedded "/" characters. The "i" at the end is to specify case insensitiveness.
Then, strip off everything from the first "/" to the end of the URL (doing nothing if there is no "/"):
$url =~ s!/.*!!;
Last, in case the URL was http://example.com?stuff or http://example.com#stuff or http://example.com:80/whatever, also strip off everything from the first "?" or "#" or ":", if present:
$url =~ s/[\?\#\:].*//;
The value of $url is now the domain name by itself.
Will Bontrager
Was this blog post helpful to you?
(anonymous form)
All information in WillMaster Blog articles is presented AS-IS.
We only suggest and recommend what we believe is of value.
As remuneration for the time and research involved to provide quality links,
we generally use affiliate links when we can.
Whenever we link to something not our own, you should assume
they are affiliate links or that we benefit in some way.
Recent Articles in the Library
Image Transparency
An image can be given a degree of transparency with the CSS opacity property.
Finding Div's Top-Left Corner Position
The coordinates of any type of HTML element can be determined.
Silent Notes
Once implemented, this system is a nice logger when occurrences of extra-important events on a web page need to be counted.
JavaScript and URL Parameters
A URL parameter in a browser's address bar can be used with JavaScript.
Closer Image
Hover and the image appears to come closer to the surface of the web page.
Database Table Dump
Database Table Dump can be used to view database table information without accessing the domain control panel.
Valid File Name Characters
Most simple printable characters are valid within file names. Some are not.