After Prague…May!

Hi! after one month and a half some things have happened…I am going to summarize a bit…writing the “Highlights”!

Reading (pi = persona interest)

Reviewing

Writing

Coding and Tools

  • I have started to design a taxonomy collecting main KPI from the CSMIC initiative and other existing works to create the Cloudindex. Some initial outcomes can be checked here (it is very similar to my work at the Webindex, however the computation process must be added and the URIs are not yet accesible):
  • …I plan to report this on-going work to the CSMIC initiative!
  • I was also testing the IMPALA tool of Cloudera for real time queries.
  • I would like also to emphasize the excellent work of the Media Lab Team at the University of Athens creating the “PublicSpending.net” portal. My contribution has been small but I hope to contribute more in the future.

Teaching

  • During my stay in Spain in the Easter holidays (first week of may) I was invited by Jose Emilio Labra to give some seminars in the Semantic Web Course in the context of the Master of the Web Engineering. I am very grateful for this opportunity and I hope I can repeat. I was presenting part of my PhD and my current research SEERC. I think students were more or less happy with the contents! (My last lecturer had been in January I was a little bit not-trained).
  • Finally I have also collaborated in reviewing and partially supervising a final degree project on sentiment analysis. It has been a great experience!

Other things

In Spain I could meet my family and friends, visit places such as Tazones, go out and, mainly, drink and escanciarSidra Asturiana” and eat in places such as Tierra Astur or Los Lagos…and more things that you should ask me! 🙂

Map/Reduce intro

I have been playing with different Map/Reduce tools and examples to acquire the whole of view of this programming to solve different problems. Basically there are several patterns that can be identified when you need to tackle the implementation of some algorithm. I would like to leave some references that have served as inspiration:

In the same way I have prepared some slides compiling some basic examples that I have adapted from the original sources to deliver my own tutorial. I firmly believe that the best way of learning someting is try to explain to others so I hope to extend these slides in the future covering most of the patterns with different examples. At the moment I am going to
leave here the presentation:

Now it is time to play with Impala-Cloudera…I will keep you inform of the progress!

Week #12 and #13. The MSTC in Prague

Last week I was preparing my presentation for the RELATE-MSTC (Multi Skills Training Camp) in Prague. I was very exciting because it was going to be the first time I was to meet with my colleagues in the RELATE project. Now I can definitely say that the experience is being amazing! We have attended to  talks with different scopes: presentation skills, paralell programming, etc. In the same way we have attended to our own presentations. Each fellow was in charge of presenting their results, research questions and so on to finally keep a discussion with others.  This kind of collaboration meetings are very fruitful and I think I have improved my knowledge about some cloud computing issues. I have also presented my progress in quality of service and I hope to do my best in the next camp. Apart from that we are now attending to the ICPE 2013 conferencia, more specifically to the Hot topics workshop!

Let me leave my presentation  (position) here:


I will continue posting following the previous structure…

Papers to read

Reading on-going:

MapReduce Design patterns

Last and this week I have been preparing an introduction about the Map/Reduce algorithm I have found a lot of new excellent references (see my previous post) and I have read some books that I did not know. As a result I have made a compilation (it is just a summary that I will continue updating) that can serve as roadmap about what MapReduce is and what you can do with this programming model. As soon as I review my presentation and the examples I will upload them,

Week #10 and #11 in Thessaloniki

Hi! I have been a little bit lazy about writing in the blog but it is now time of recovering good practices. I am going to summarize my tasks during the last two weeks,

Reading (pi = persona interest)

Writing, reviewing and researching

  • I am managing a Special Issue in the Journal of Computers in Industry, Elsevier.
  • I finished the review of a book for Manning Publications.
  • I have been included as Technical Development Editor in Manning Publications.
  • I have been included as PC member in the workshop proposal “Data Mining on Linked Data (DMoLD’13) workshop with Linked Data Mining Challenge”  thanks to my colleagues at the University of Economics in Prague.
  • I am reviewing a paper for the journal “Expert Systems with Applications” (IF: 2.203)
  • I am reviewing and finishing the paper with my colleague Alejandro Montes about his final Master Project.

Meetings

  • I have had a meeting with my SEERC colleagues to talk about next actions.
  • I have had a meeting with Michalis Vafoupolus to prepare the Linked Data Cup paper.
  • I have had two meetings with Lum about his Bachelor Degree Project. It is a kind of supervising to address the problem of sentiment analysis using Rapidminer, Lingpipe, Alchemy API and a custom solution.

Coding and Tools

  • I have made in my leisure time a tool for unifying company names called CORFU using Python, NLTK and the APIs of Google Places, Linkedin and Google Suggestions. It also includes other algorithms based on string similarity, etc.
  • I have developed a simple sentiment analyzer using Alchemy API.
  • I have adapted some examples of Map Reduce patterns

Other things

  • I continue my fight to learn Greek…I have to study a bit more!
This is all I can remember…perhaps I am missing something…!

Week #9 in Thessaloniki

Just a few comments for this week (to be completed)….

Reading (pi = persona interest)

Writing, reviewing and researching

  • I am finishing the book chapter about publishing statistical data in RDF
  • I am managing a Special Issue in the Journal of Computers in Industry, Elsevier
  • I just realized that Labra added me in the Acknowledgements part of his work about “Multilingual Open Data Patterns” I am very proud of that! (to be honest I just collaborated in the first presentation with some links and specially through some comments with regards to SKOS-XL). I also suggest to read the paper in which each of the patterns is explained and discussed with excellent examples.

Meetings

  • I have had a meeting with my SEERC colleagues to present my prototype and plan next actions in QoS, etc.
  • I have had a meeting with Michalis Vafoupolus to prepare the Linked Data Cup paper.

Coding and Tools

  • I have implemented a real-time based architecture using the Lambda approach and following some hints from Pere Ferrera. It is not the same algorithm and I am just take the approach to tackle the problem not source code. Next steps include to use RDF as views for batch and real-time layers using SPARQL federated queries (for instance Fedex). The example just takes a Twitter stream using Tweet4J API and counts words presenting the results in a HTML page. Documentation is available here and also the the source code (under development).
  • I have linked to the CPV the public procurement notices from UK, USA and AUS.

Other things

Week #8 in Thessaloniki

Just a few comments for this week (to be completed)….

Reading (pi = persona interest)

Writing, reviewing and researching

Meetings

  • I have had a meeting with my SEERC colleagues to present my prototype and plan next actions in QoS, etc.
  • I have attended a call with the Oviedo Council (planned)

Coding and Tools

  • I have re-implemented my real time word counter of Twitter status using RabbitMQ, Node.js, Storm and Redis (see Demo wiki page).
  • I am going to test the MOA framework (based on Weka) for mining data streams. In this sense I have also found an implementation of MOA in Storm.

Other things

I have Internet connection at home!

Week #7 in Thessaloniki

Just a few comments for this week (to be completed)….

Reading (pi = persona interest)

Writing, reviewing and researching

  • I am reviewing a paper for a Special Isuee of a JCR journal
  • I am finishing the book chapter about publishing statistical data in RDF
  • I have also made the first review of WESOMENDER (we have to work hard to get a good contribution but the expectations are high)
  • I have been invited to be part of the PC of the Special Session “Engineering Tool Integration for Industrial Automation System Development (ETAS 2013)” in conjuction with IECON2013
  • I have joined in the research group “Comercio Electronico en Colombia – GICOECOL” thanks to Luz Andrea RODRIGUEZ ROJAS with whom I will collaborate to empower the use of Open Data in e-Health.

Meetings

Coding and Tools

I have implemented a real time word counter of Twitter status using different tecniques:

  • The classical Observer design pattern
  • The Storm framework, I have reused some examples to implement my own spouts and bolts
  • The Trident framework on the top of Storm, I have also reused some examples of the storm-starter project customizing the code to get a better understanding

Other things

This week I have started the 3-month Greek course and I am very happy because I can now understand some words and read a little bit 🙂 Besides my classmates are from a lot of countries: Bulgaria, Germany, Bosnia, France, Serbia, New Zealand, Italy, Moldova and Russia. It is a GREAT experience.

 

Week #6 in Thessaloniki

Just a few comments for this week….

Reading

Writing and reviewing

  • I have written a paper entitled “Semantic-based QoS management in Cloud Systems: Current Status and Future Challenges “. It is a survey of 25 pages in which I review most of the approaches for QoS in Cloud Computing as well as the main techniques for stream reasoning in the Semantic Web area. It must be the kick-off of the architecture for semantic-based QoS management in Cloud Systems. and I also have the acronym “SEQoS” (remember it because you will hear about it!)
  • I have also managed tasks regarding the special issue in COMIND.

Coding and Tools

I have not made any relevant progress in developing tasks.

Week #5 in Thessaloniki

The last week I have been focused on two main tasks: my presentation at the City College and the submission of a paper. Following the same structure of last weeks I leave some links to the activities I am carrying out:

Reading

Writing and reviewing

  • I have continued with the structure and firts contents of two papers and one special issue proposal.
  • I have managed all the abstracts for the COMIND Special Issue.
  • I have submitted a paper to “Computers and Human Behavior
  • I have made the presentation in the following bullet to the Deparment of Computer Science at City College
  • I have reviewed my previous presentation about MOLDEAS and the new one is supposed to be more didactic such as an “Intro” to Linked Open Data

Coding and Tools

I have not made any relevant progress in developing tasks.

 

Week #4 in Thessaloniki

This week I have made some interesting tasks. Following the same structure of last weeks I leave some links to the activities I am carrying out:

Reading

Writing and reviewing

  • I have continued with the structure and firts contents of two papers and one special issue proposal.
  • I have submitted a contribution to the European Data Forum 2013.
  • I have prepared a presentation (I have to make some changes…)

Coding and Tools

I have not made any relevant progress in developing tasks but I have being refreshing my know-how on Python.

 

Week #3 in Thessaloniki

This week I will be updating this post because I am reading a lot of papers and I need a way to track them. Following the same structure of last weeks I leave some links to the activities I am carrying out:

Reading

I have focused on some interesting subjects Statistics (Bayesian networks), Data Streams, Feedback Control Loops, Autonomous Computing and e-Learning systems (this is just for personal interest). I have started, and finished, the next list of papers and books:

Writing and reviewing

I would like to leave the link to an article about “How to review a paper“, an excellent guide to evaluate your reviews and take into account your responsibilities as reviewer.

Coding and Tools

I have not made any relevant progress in developing tasks but I have being refreshing my know-how on R.

Teaching

I have finished the evaluation of alumni in Health Information Systems and I am very proud of the marks and the work carried out by student during the last months. I have some links of their works building mashups but I prefer do not leave here the links due to privacy issues.