Thanuditha Ruchiranga

Wednesday, September 14, 2016

Feature based face detection with ViolaJones algorithm

What you see above is how the ViolaJones algorithm detected my face! This result can be obtained with a couple of lines of code in MATLAB and is not a difficult task with all the powerful features it provide.

In general, the problem addressed in face detection is to classify a given image depending on whether it contains a human face or not. Feature based face detection is one intuitive approach in solving it. The general concept behind this approach is to identify certain interesting features of a given image and depending on the features identified, decide whether the image contains a human face or not. The features can be actual physical features of a face like eyes, nose and mouth or any other interesting points when looked from an image processing perspective.

Not all feature based face detection algorithms utilize machine learning techniques in getting the problem solved. Some approaches utilize image processing techniques to identify interesting features and deduce decision based upon them. Out of the machine learning algorithms used in feature based face detection, the Adaboost algorithm proposed by Paul Viola and Michael Jones (commonly known as the ViolaJones algorithm) is the most popular one.

The specific features used in the ViolaJones algorithm are a set of rectangular patterns called Haar features. Some examples of such Haar features are shown in the following figure.

These feature masks are laid over the image in every possible manner and a value that represents the amount that particular region in the image matches the feature mask is calculated. The higher the value, the better the region in the image contains that particular feature. The authors of this research have also suggested a very efficient way of performing this process programmatically by creating an intermediate representation of the image known as the ‘Integral image’.

Once we have a feature set and a set of positive and negative training images, the next step is to find the most important set of features that contribute in accurately classifying the images. This is where machine learning techniques are used. Adaboost is one such technique that can be used in selecting the features and training the classifier.

In simple terms consider having n different features and a set of m positive and negative training images. Each feature is evaluated on each of the images and a certain value is obtained. A certain weight is associated with all such values obtained and initially all those weights would have equal values. Then for a certain feature, a threshold is determined considering the distribution of the feature evaluations of each of the training images and their weight values. The images producing a value greater than that threshold with respect to a particular feature are classified as having a face and vise versa. Depending on the threshold value decided, there will almost always be some misclassified images as well. The weight values of all such misclassified images are obtained and the sum of all those is used as the error measure for that particular feature. This process is carried out for all the features and the feature that yields the minimum error measure is chosen to be included in the final classifier. Then before moving on to the next iteration, the weights of misclassified image values for all the features are increased so that they would be given more attention in getting them classified correctly during the threshold determination in the next iteration. This is the machine learning process followed in the Adaboost learning algorithm.

Once the most important features have been selected, the algorithm would assign weights to each of them and a linear combination of them would give out a strong classifier function that can be used to classify a given image determining whether it contains a human face or not.

Monday, September 5, 2016

Every Story Has An Ending. Every Ending Is A New Beginning.

Yes. It is a universal truth that everything must come to an end. And so did Google Summer of Code 2016. The program ended on 30th of August 2016 making the end of another successful Google Summer of Code program for Google as well as Joomla!

As a Joomla! GSoC student, I worked on the JavaScript testing project which introduced JavaScript testing for the JavaScript libraries in Joomla!. Now that the program is over, it would be quite reasonable to say, myself with my mentors Ashan Fernando and Yves Hoppe could achieve more than what we had in mind in the beginning. A stable testing environment was setup with Jasmine and Karma. The tests were also configured to run on Travis. In cases the existing code were not testable, we refactored the code and improved their testability. Above all, a comprehensive test suite was written to cover almost all the existing JavaScript libraries along with a detailed, easy to follow documentation.

Testing always is a requirement in any kind of large software system. The final outcome of the project is mainly going to save developers from a lot of unnecessary hassles. Of course the benefits are not limited to that. I have explained the importance of this project to joomla! in detail in one of my previous articles which you can read from here.

Considering what we have achieved so far as a team, I feel really proud and satisfied. It was not just coding I did during the program. It was much more than that. The whole experience was amazing. Getting to know new people and making new friends. And with respect to the work done, the feedback I have received from my mentors and the community always make me feel I accomplished something great.

Most of the people tend to see GSoC as a coding program. That is really far from the truth. If you are lucky enough be with an organization like Joomla! you would definitely learn a lot more than that. It is true that the project was technical and I got the chance to learn concepts in testing and earn experience using technologies like Jasmine and Karma in and out. But my mentors gave me advices even on how to market things and get people interested in listening to what I have to say. We had to write articles regularly during the program and each of them gave me some learning on writing things better.

From the very beginning of the program, I felt really glad I got the chance to become a part of this awesome community. Our GSoC program admins did an amazing job at making us feel welcomed and motivated throughout the program. The perception I earlier had regarding Open Source contributions was like it is something dull and boring. But it was a totally different story with Joomla!. The experience I got with Joomla! was quite the opposite of that. Everything about it was so interesting and exciting and I look forward to keep helping Joomla! being a part of it even though the GSoC program has ended.

For those of you who are interested in knowing more about the work I did, you can checkout the PRs I made to the Joomla core repository here. All the PRs related to the main scope of the project have already been merged and is being used live in the core. It makes us really happy to see other Joomla! developers also writing new tests for the new JavaScript libraries and functionalities they develop on top of the testing setup we introduced.

Furthermore, all my previous articles I wrote regarding this project can be accessed from here. And for those of you who would like to write new tests or try out the work we have done, everything you need to know is in the documentation I wrote which is accessible from here.

Many thanks goes to everybody in the Joomla! community that helped in making it all happen. Thank you!

Sunday, September 4, 2016

Why Joomla! Needs JavaScript Testing

Joomla! CMS is quite famous as a system written in PHP. But still there are certain things that PHP alone would not be able to address. Being a server side scripting language, PHP does not have any authority in what happens on the client side. Therefore PHP becomes helpless in doing even very simple things that involve changing the web page content depending on user’s input.

Yet this functionality lays the foundation in building web applications that provide a dynamic and interactive experience for the user. This is where client side scripting languages come into play. JavaScript has become the most popular client sided language and it is quite hard to find a web based system that does not use it.

Being a large scale Content Management System, around 7% of Joomla’s code base is written in JavaScript. JavaScript handles a variety of tasks which add valuable functionality to the Joomla! CMS. With time that is going to improve even further as JavaScript is rich with features like AJAX and there are numerous possibilities of improving Joomla! more and more with them.

For instance, there is a custom JavaScript library called JCaption in Joomla which has been written to make the life of developers easier. Whenever there is an image that needs to have a caption, this library takes the title attribute of it and dynamically adds the text below it.

Another bunch of these useful functions can be found in the core.js JavaScript library. Joomla.renderMessages performs all the dynamic renderings of errors, warnings or notifications. It expects an object which contains the message and the rest of the work for showing it is taken care of by itself. If not for this library, developers would have to specifically write code in each and every place that needs messages rendered on the interface.

Likewise there are so many other similar JavaScript libraries in Joomla! code base adding tons of useful functions. With all this complexity, to make sure these libraries perform as they should, inevitably, testing becomes a serious requirement.

So let me tell you why it is crucial for Joomla! to have JavaScript tests covering the JavaScript libraries:

Following are some issues from Joomla issue tracker that could have been avoided or solved with less effort if there was a comprehensive test coverage in the JavaScript libraries.

https://issues.joomla.org/tracker/joomla-cms/10259

The submitted issue is about the validate.js JavaScript library, which basically handles front end validation aspects for HTML Forms. The issue has been that even though the library is supposed to validate all form fields even if they are placed outside the relevant Form tag, the library does not perform the validation on them. The reason for this issue has been the commit https://github.com/joomla/joomla-cms/commit/763c69f4bedc38f62ddf4f04af9d3d7fe3f3a719 which has removed Mootools dependency from that same library.

Now assume we had a comprehensive test suite covering the validate.js functionality. The moment the Mootools removal commit is made, Travis would start running the tests against the Pull Request and would go complaining about the test cases being failed. That way the person, who made the Pull Request is going to identify what mistakes she or he made and what needs to be done to fix it. If this was the case, the issue #10259 would not have even showed up and a lot of time and effort of multiple contributors could have been saved.

Every pull request in Joomla needs to be tested successfully by at least two people before it can be merged. In most issues, it can be well observed that multiple iterations of fixing and testing is needed to be carried out until the issue gets completely fixed. https://issues.joomla.org/tracker/joomla-cms/9243 is an example for that. This happens due to two reasons. One is the inability of a single person to look and try out all the possible ways his or her fix could be tested. The other reason is the fact that there is a high possibility an outsider might notice something the developer missed. But still this process consumes a significant amount of time and sometimes the problems they find are only the pretty straightforward ones. Having a comprehensive test suite would definitely reduce a considerable amount of this time.

As you can see, our JavaScript tests could make a significant impact on Joomla! system as well as its community. These tests are so important for Joomla! that we had a Google Summer of Code project specifically focused on it! Good news is that the whole testing setup as well as the comprehensive test suite has already been merged to the Joomla! core and is in use now.

Monday, January 4, 2016

Going Cloud Native with Amazon Web Services: Tutorial on Amazon API Gateway, AWS Lambda and Amazon DynamoDB

The term 'Cloud Native Applications' have become a buzzword in the present computer science and software engineering jargon. This basically refers to a drastic deviation from the classical, usual application development and deployment architecture. The traditional development approach for a web based application is to develop it from scratch locally and then deploy or host the whole application on a cloud storage. On the contrary, cloud native applications are built with a minimum amount of lines of code and mostly making use of separate cloud services that can interact together to perform the domain specific tasks. Amazon Web Services provides an amazing whole bunch of such services that allows us to literally do any thing we want in terms of application development.

Most applications tend to have a common architecture where there is a front end application talking to a RESTful API which would allow access to a database lying underneath to enter or retrieve data to and from it. In this blog post I would be basically focusing on how to get this kind of an application architecture up and running using the three Amazon Web Services Amazon API Gateway, AWS Lambda and Amazon DynamoDB.

Amazon DynamoDB

DyamoDB is a scalable NoSQL database service provided by Amazon. Packed up with a lot of additional features like alarms, indexes and triggers, it provides the basic requirement of data storage for an application with fast and predictable performance and seamless scalability.

AWS Lambda

AWS Lambda is a light weight compute service that allows us to execute a piece of code in response to some trigger like an API call or an event occurrence. A Lambda function can be created using any of the 3 languages Node.js, Java or Python (as of when this post was written). The most interesting thing about AWS Lambda is that we are only responsible about the piece of code we write and AWS handles all the infrastructure management aspects of it to provide us with the optimum performance.

Amazon API Gateway

Amazon API Gateway is a service that allows you to create API's in a matter of minutes without writing a single line of code! It has its own management console which depicts the API resources we have created hierarchically with the methods associated with all the endpoints. We can simply associate any endpoint to a Lambda function so that a call on that endpoint would trigger the Lambda function to execute. API Gateway also allows creating an endpoint to act as an HTTP proxy to another endpoint too.

The objective of this blog post is to provide you with a clear step by step guide on using these above said 3 services together and come up with a simple web application that makes HTTP calls an API to get some domain specific work done. Hang tight! its gonna be a long journey!

This tutorial would contain mainly 3 steps which namely would be

Creating the database in DynamoDB
Configuring a Lambda function to communicate with the DB and finally
Configuring the API gateway to trigger that Lambda function upon an API call.

Before stepping on to the first step in our list, let's have an overview idea on the architecture of the application that we will be building.

Just by the looks of it, it looks quite simple and indeed it is. We will be creating a front end web application that makes HTTP requests to the API gateway. The API gateway would trigger the respective Lambda function which would talk with the relevant DynamoDB table to return the necessary data back to the API Gateway. Finally the API Gateway would send back the response to the front end web application based on the contract it has with the end user.

An important thing that needs to be mentioned here is about the regions selection available in AWS. AWS allows you to select a region from where the services are consumed from. For example, if you use DynamoDB in US East (N. Virginia) region, the actual infrastructure provisioning for you will happen actually in that area. This way you get to choose the closest region to your target customer to reduce latencies in communication and improve the performance of your application. This also provides the added advantage of having disaster recovery sites. How ever, currently, some services in AWS are supported only in some regions. For example currently Lambda functions can be created only in the regions US East (N. Virginia), US West (Oregon), EU (Ireland) and Asia Pacific (Tokyo). Therefore you need to make sure to select one of the above 4 regions when creating the DynamoDB table if you are to associate it with a Lambda function later on. It should be noted that for you to integrate some services together (like Lambda and DynamoDB), the relevant implementations should be done in the same region with each of the services. But this does not apply to each and every the Amazon service though.

There is no particular order for following the above said 3 steps. But just for the sake of clarity, I will follow the order they have been mentioned and start with the first step.

What you will be seeing next is the DynamoDB Dashboard which basically gives you some statistics (which probably won’t make much sense to you if you are here for the first time; which is perfectly alright!). Click on the Create table button to begin creating a table.

A bit of the domain stuff. The application we will be building is a simple employee directory that allows saving an employee’s details on a DynamoDB table and retrieving them back upon request. So for that we will create a table with the name Employee.

Going to bit of detail, the primary key in DynamoDB can either be only a partition key or a combination of a partition key and a sort key. The partition key is used as input to an internal hash function of which the output determines the partition where the item will be stored in. If only the partition key is used for the primary key, every record needs to have a unique value for that. The second type which is a composite primary key uses the partition key as usual to decide where the record is stored and the sort key is used to keep the items with the same partition key stored sorted in order. In this scenario, It is possible for two items to have the same partition key value, but those two items must have different sort key values. This primary key concept in DynamoDB makes it a deviation from the document databases like MongoDB which does not have a requirement like that.

Coming back to the table creation, let us give partition key as ‘id’ and select its type from the drop down list as Number. To make things simple let us not go for a sort key. Under the Table settings area keep ‘Use default settings’ check box ticked since we do not need to go for advanced configurations for the moment. Once all that is done you are good to create the table.

Next, we will add some sample data to the newly created table. We would find them useful when testing the lambda functions that we will be using to retrieve data from the DB. For that, go to the Items tab and click on Create item. On the dialog box that appears specify a unique id value and add 2 more fields ‘age’ and ‘name‘ with types Number and String respectively and click Save.

You can create another record following the same steps.

We have now completed the first step in our 3 step process. The database is set up with a table having sample data.

Next, we will be stepping on to working with the Lambda functions. For that, click on the Services menu on the navigation bar and under Compute, select Lambda. If the selected region does not contain any Lambda functions created earlier, you will be getting a home screen where you have to click on Get Started Now. If the selected region already has some functions created before, you will be seeing the list of those existing functions.

To start with, click on Create a Lambda Function.

The first step of creating a lambda functions prompts us to select a predefined template that would assist us in writing the Lambda function’s code. You can select the blueprint that suits your requirement or else skip that step if you want to write the code your own way from scratch. For this tutorial I will be providing you with a custom code snippet and so we will be skipping this step.

The second step deals with configuration stuff. You basically have to enter a name and a description. A best practice in naming a lambda function is to name it with some correspondence to the URL of the API endpoint that we hope to associate it with. For our case I plan on creating an endpoint employee/{employee-id} to call this lambda function and return the information of the passed in employee id. So I would name this lambda function ‘employee-employee-id’. The Runtime allows you to choose which language you are writing the code from. I have written my code in javascript and so I will be choosing Node.js. Paste the following code in the editor under Lambda function code.

The first and the second lines of the above code makes it possible for the function to use the DynamoDB API.

exports.handler = function(event, context) {};

is kind of like a template code you have to use in almost every scenario you will be using Lambda. The code to be executed is put inside this function. The 'event' and the 'context' variables carry in some valuable data to the function scope. The event variable will have any of the values sent in by the API gateway (using mapping templates). The context variable will contain the context information (yeah obviously!). You will be able to see its exact contents once we create the function and test it.

Under Lambda function handler and role let handler have its default value index.handler and for the Role, something like lambda_dynamo should be selected. If there is no such role under Use existing role, you will have to create one with permission to use DynamoDB.

The advanced settings allows control over the code execution performance and maximum time a function call can run. We will not need to pay attention to these for the moment and can be left with the default values. Once you click next, you will see a review page with the Create function button.

Once the function has been created, AWS Lambda has this cool feature that allows us to test its execution. Before we test we need a way to specify the input values that will be coming into this function via the 'event' variable. For that click on the 'Actions' button and select 'Configure test event'. This editor that comes up has a sample object with sample key value pairs which basically is going to be the object being passed to the event variable as the test is run. With that said, you might have already figured out the code that you need to put there. In case you didn't, here is the code that needs to be put there.

{

"id": "1"

}

Once that code is put there, we are good to go ahead and test our function. Click on Save and test button, the function will run, parse the specified id string which in this case was '1' to an integer and query the database to retrieve the relevant record. The results are shown just below the code editor and hopefully, If you have followed the tutorial right, you should be seeing something like

You can also see the Log output on the right hand side which has the contents of the context variable printed in it.

One quick bonus tip: Lambda functions can be triggered in many ways to get different types of work done from it. One such interesting usage is a scheduled function that runs periodically. This can be configured by going to the Event sources tab and clicking on Add event source. There in the Event source type, select Scheduled event and give in the necessary configurations. This is not relevant to this application we are working on so I will not go into any further detail with this.

Up to now we have created the necessary DynamoDB table and configured and tested a lambda function that talks with the DB to return back the record of a passed in id value. This marks the end of the 2nd step in our 3 step process and we are now moving to the final step which is to integrate API Gateway to this game.

Go to API Gateway service and create a new API. Give a suitable API name and a description of your wish and click Create API.

The API is going to be having resources and methods and you will see what they mean in a moment. If you remember about the naming convention I used for naming the lambda function, I am expecting to have an endpoint like employee/{employee-id} to pass in an id value and get the relevant employee details in return. For that we will have to create 2 resources. Click on Create resource and give Resource name as employee and click Create Resource. Now click on the newly created employee resource on the left hand side hierarchy pane and click again on Create Resource. This time it is going to be the parameter id passed in from the query string. So we type employee-id in the Resource Name field and in Resource Path field modify the value to {employee-id} by adding curly brackets around.

Finally click on Create Resource and we are now done creating resources. Next step is to create a GET method to this endpoint because that is what we expect from it. If you are not familiar with the types of methods like GET, POST and PUT and what they are used for, I would recommend you to google around and see for yourself.

To create a method click on the /{employee-id} resource and click Create Method. On the drop down list that appears, select GET and click on the check mark. What comes next is the configuration setup for our newly created method. Since we are going to incorperate this with our Lambda function we will select the Integration type as Lambda function (yeah obviously). And in the Lambda Region select the region that you created the lambda function in. Once the region is selected, enter the name of the lambda function that we created which is ‘employee-employee-id’. Finally click Save and click OK to give permission for this endpoint to invoke our Lambda function.

The next screen that comes up is going to give you a nice overview on how API Gateway handles the incoming requests and sends back the responses.

Digging into some detail, the Method request, Integration request, Integration response and Method response can be considered as the 4 stages of a request response cycle in API Gateway. The Method request allows you to define the contract you have with the end user in terms of requests. Integration Request gives you the chance to modify the incoming request to suit what ever the interface you have in the Lambda function. For instance that is where we specify the mapping template for passing the query parameter into the Lambda function. In the Integration response you once again get a chance to modify the output coming from the Lambda function to match the interface you have with the end user. For example, if the Lambda function throws an error, this is where you get the chance to map an error to a code to be sent back to the end user in the response. Finally the Method response stage is where you define those codes and the contract between the API and the end user with respect to responses. This is just a quick overview on what each of those stages mean and for the application we will be building, we will only need to go inside the Integration Request.

Click on Integration Request and in the screen that appears click on Mapping Templates. Click Add mapping template, type application/json in the text field that appears and click on the tick mark. Once that is done on the right hand side click on the pencil icon next to Input passthrough and select Mapping template from the drop down list. Type the following text in the editor that comes up.

{

"id": "$input.params('employee-id')"
}

This is where we tell the API Gateway, "hey, please pass this object to the Lambda Function with an attribute id assigned with the value that comes under the ‘employee-id’ parameter in the query string". This $input is a variable that API gateway provides to give us access to the contents in the input request. There are several other variables like $context, $stageVariables and $util and a complete documentation on how they can be made into use is available via the API Gateway Mapping Template Reference.

Once the template is specified in the editor, do not forget to click on the tick button next to the drop down list where you selected Mapping Template to save the changes.

We are almost done with our configurations and it’s time to test our API endpoint now. Once again API gateway also has this cool feature that allows us to specify any parameters in the endpoint and trigger an API call to see if the correct results are being returned. For that. click on Method Execution button next to Deploy API and click on TEST on the Client box. Next specify the value for the employee-id parameter and click the Test button.

If you could get the above screen, then congratulations! you just created a working endpoint that serves a GET request.

But it’s not over yet. For you to use this API and expose it to the world, you have to deploy it. For that click on Deploy API, select New Stage as the Deployment stage if you don’t have any stages created before. Enter relevant values of your choice to the rest of the fields and click Deploy.

On the very next screen you get to see the invoke URL for your very first API along with many other configuration options which I won’t be paying attention to in this post.

The invoke URL for the API I created is https://ubq7zz9sml.execute-api.eu-west-1.amazonaws.com/alpha

and technically, sending a GET request to https://ubq7zz9sml.execute-api.eu-west-1.amazonaws.com/alpha/employee/1

should return me the record of the employee with the ID as 1.

As expected it does.

There is one small thing that might be important to you if you are going to test this API endpoint with one of your applications running locally or on another server. Even though the GET request that we send by navigating to it via the browser works fine here, if you try to do the same from a script in your application via an asynchronous HTTP GET request, you are going to be in a bit of a trouble.

XMLHttpRequest cannot load https://ubq7zz9sml.execute-api.eu-west-1.amazonaws.com/alpha/employee/1. No 'Access-Control-Allow-Origin' header is present on the requested resource. Origin 'http://localhost:4000' is therefore not allowed access.

In case you are feeling lazy to try that out on your own, download this small AngularJS demo web application that runs on a simple node server (You will have to have NodeJS installed on your machine to run the node server. If you don't have it, download NodeJS from NodeJS web site and install it).

All you need to do is to go to public\controller\controller.js and assign $scope.domainName variable, the invoke URL of YOUR API deployment. Then run the node server which will make it listen to incoming connections on port 4000. Go to http://localhost:4000/, enter a value for the ID and hit Go and if you bring up the console of the developer tools of your browser, you definitely will be seeing this error.

In case this is the first time you see this error, the reason you are getting this is that browsers usually do not allow cross domain asynchronous HTTP requests simply to enforce security and avoid attacks like XSS. The only way such HTTP requests are allowed is to have the Access-Control-Allow-Origin header set on the response coming from the target domain. If you look at the response headers in our case you can see the above said header is not present in the response.

To fix this we need to go back to the API Gateway and click on the {employee-id} resource. On the right hand side pane click the button Enable CORS which stands for Cross-Origin Resource Sharing.

There you get to select the methods that you need to enable CORS for which will have all of them selected by default. On the Access-Control-Allow-Origin* field you get to specify the domains that can access this resource. The ‘*’ input there by default is a wildcard that allows any origin to access this resource. For our demo application these default settings serve to be fine and we can click on Enable CORS and replace existing CORS headers button. Click yes when prompted for confirmation. Finally to make the changes you made take effect, click on Deploy API again, select the same stage that you had created earlier and click Deploy.

Now when you go to our demo application, enter an id and hit Go, you should be seeing the details of that employee like this.

And if you go through the response headers this time, as expected you would be able to notice the Access-Control-Allow-Origin header present on the response and set to the wildcard ‘*’.

This is all that I planned on sharing with you in this blog post. I should also mention that every single bit of this knowledge I have shared here is from the experience I gained from the work I did at 99X Technology as an intern.

Up next will be a tutorial on how to use Amazon Cognito service for authenticating users in a web application. Do stay tuned!

Subscribe to: Posts ( Atom )

Thanuditha Ruchiranga

Wednesday, September 14, 2016

Feature based face detection with ViolaJones algorithm

Monday, September 5, 2016

Every Story Has An Ending. Every Ending Is A New Beginning.

Sunday, September 4, 2016

Why Joomla! Needs JavaScript Testing

Monday, January 4, 2016

Going Cloud Native with Amazon Web Services: Tutorial on Amazon API Gateway, AWS Lambda and Amazon DynamoDB

About Me

Find me on LinkedIn

Popular Posts

Visitors