Skip to content

zhenyangze/Hawk

 
 

Repository files navigation

Hawk: Advanced ETL & Crawler Stream Engine


Welcome to use Hawk! Hawk can graphically crawle webpage, clean, process and save data without programming, written in C#/WPF, open source according to the GPL protocol.

Introduction

The meaning of Hawk is "Eagle", which can kill prey efficiently and accurately. Its design idea comes from Lisp language , and its function mimics awk in Unix.

The key features are as follows:

  • Intelligent analysis of web content without programming.
  • WYSIWYG, conversion, filtering & storage with visually drag and drop.
  • Parallel processing & high speed
  • Support mutli-file & database: xml, csv, sqlite,mongodb...
  • Tasks can be save, pause, restart & reused.
  • Focus on crawler but power is far beyond that.

splash

Fast & Smart webpage crawling :

![2.gif-3330.9kB] (http://static.zybuluo.com/buptzym/10kykg6qhqvsabbq8yj32pt0/2.gif)

WYSIWYG ETL:

![1.gif-1001.8kB] (http://static.zybuluo.com/buptzym/qkl0vavjn6cj007qfk2k3gqg/1.gif)

Upgrade History

  • Hawk3.5

  • Hawk3

  • Hawk2.1

  • Hawk2

  • Hawk

About

visualized crawler & ETL IDE written with C#/WPF

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C# 100.0%