Toolkit for large scale studies of web documents
Download
1 / 17

ToolKit for Large-SCAle studies of Web documents - PowerPoint PPT Presentation


  • 109 Views
  • Uploaded on

ToolKit for Large-SCAle studies of Web documents. SI4 June Project Imane BELLAT – Bastien BLANCHARD Clément TOCHE – Maurice YARED. WorkFlow with Modularity. Initial WorkFlow. More Complex WorkFlow. Agenda. Project Subject. Problems. Outlooks. Some Tests.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' ToolKit for Large-SCAle studies of Web documents' - kaspar


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Toolkit for large scale studies of web documents

ToolKit for Large-SCAle studies of Web documents

SI4 June Project

Imane BELLAT – Bastien BLANCHARD

Clément TOCHE – Maurice YARED


Accessibility multiple paradigms

WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

  • Web documents evaluation process

    • Co-operating services

  • Evaluations and Requirements

    • Web Accessibility Initiative (WAI)

    • Checkpoints from the Web Content Accessibility Guidelines

  • Additional Services

    • Page Rank

    • W3C Validation

    • PrettyPrint of statistics

Accessibility : multiple paradigms


Scalability tools

WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

  • Grid Workflow Efficient Enactment for Data Intensive Applications (Gwendia)

    • Workflow description framework

    • Grid Infrastructure

    • Optimizingdistributed computation

    • MOTEUR as WorkFlow Manager

  • Goal

    • Accessibility tests on elements in HTML, CSS, Script code

Scalability Tools


Agenda

WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

  • Project Subject

  • Initial WorkFlow

    • Vision of Architecture

    • Code Implementation

    • WorkFlow Structure

  • WorkFlow WithModularity

    • Concept

    • WorkFlow Structure

  • More Complex WorkFlow

    • Vision of Architecture

    • WorkFlow Structure

  • Some Tests

  • Problems

  • Outlooks

Agenda


WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

Vision of Architecture


WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

Code Implementation

  • Subdivision of WorkFlow in Processors

  • Encoding processors

    • BeanShell

      • Internal MOTEUR Java Code

      • Directlydeployed on the grid

    • Common WebServices

      • Accessible through Web Service Description Language (WSDL)


WorkFlow Structure

Data Entrance

Loading Data

RetrievingElements

Processing data

Processing results


WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

  • Meeting the needs of developers wishing to expand treatments

  • Communication constraints

  • Modularity :

    • By Layers

    • By Processors

Concept of Modularity

Contract Models Implementation


WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

Data flow

Data contract


WorkFlow Structure

Data Entrance

Loading data

Retrieve elts

Retrieve elts 2

Processing data

Processing results


WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

Vision of Architecture


WorkFlow Structure

Data Entrance

Domain Loader

Loading data

Retrieve elts

Retrieve elts 2

Processing results

Processing data


WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

Pretty Printed Statistics


Problems encountered

WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

  • MOTEUR has no final release version

  • MOTEUR Simulation use Common JVM Allocated Memory

    • Insufficient for large scale data flows

  • WorkFlow GraphicModeling not Malleable

  • VeryUselessDebugging Platform

Problems Encountered


Some improvements

WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

  • Implementation of all WAI checkpoints tests

    • With CSS parsing

    • Dynamic scripts parsing

  • Implementation of GlassFish WebServices in JGASW provider

  • Implementation of Security Manager Modules to WorkFlow

  • Defining automation for new processors implementation

  • Conditioning every modification of the MOTEUR Workflow according to contracts

  • MOTEUR modeling for neural networks

Some improvements


WorkFlow with Modularity

Initial WorkFlow

More Complex WorkFlow

Agenda

Project Subject

Problems

Outlooks

Some Tests

  • What we have learnt :

    • WorkFlow modeling

    • Team working and project planning

    • Facing and resolving unplanned issues

    • Working on a research-oriented project

    • Getting along with additional requests from a client


Thanks for your attention

"The power of the Web is in its universality. Access by everyone regardless of disability is an essential aspect.“

Tim Berners-Lee, W3C Director and inventor of the World Wide Web

Thanks for your attention


ad